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FOREWORD 


J.B.S. Haldane titled an anthology of some of his more dys¬ 
peptic writings “Everything Has a History,” and this is clearly 
applicable to the field of medical genetics. More than 200 
years ago, scientists such as Buffon, Lamarck, Goethe, and 
Kielmeyer reflected on how the developmental history of 
each organism relates to the history of life on Earth. Based 
on these ideas, the discipline of biology was born in 18th 
century Europe, enjoyed adolescence as morphology and 
comparative anatomy in the 19th century, and reached 
adulthood in the 20th century as the field of genetics. 
However, the late 19th century definition of genetics (hered¬ 
ity) as the science of variation (and its causes) is still valid. 
Thus, human genetics is the science of human variation, 
medical genetics the science of abnormal human variation, 
and clinical genetics that branch of medicine that cares for 
individuals and families with abnormal variation of struc¬ 
ture and function. 

In the late 19th and early 20th centuries, the unity of 
morphology-based science was gradually replaced by a plu¬ 
ralistic view of biology that splintered the field into many 
different, and often rivalrous, disciplines. However, thanks 
to the application of novel molecular biological methods 
to the analysis of development and to the understanding 
of the materials of heredity (i.e., genes), the various branches 
of biology are being reunited. This new discipline, termed 
“molecular morphology,” may be defined as the study of the 
form, formation, transformation, and malformation of living 
organisms. Indeed, ignorant as they may be of the traditional 
methods of historiography, geneticists have developed their 
own brilliant and highly effective methods. Consequently, 
they have achieved a perspective remarkably longer and much 
better documented than that of historians. This nearly 4- 
billion-year perspective unites living organisms into a single 
web of life related to one another in unbroken descent to a 
common ancestor. This makes the phylogenetic (i.e., the 
genetic relationships of different species to one another) and 
the ontogenetic (i.e., the genetic basis for the development of 
individual organisms) perspectives of development not only 
complementary but inseparable. Thus, it is now possible to 
effectively explore a key question of biology of the 19th and 


20th centuries: What is the relationship between evolution 
and development? 

In 1945 the University of Utah established the Laboratory 
for the Study of Hereditary and Metabolic Disorders (later 
called the Laboratory of Human Genetics). Here, an out¬ 
standing group of scientists performed pioneering studies on 
clefts of lips and palate, muscular dystrophy, albinism, deaf¬ 
ness, hereditary polyposis of the colon (Gardner syndrome), 
and familial breast cancer. These predecessors would be enor¬ 
mously proud of their current peers at the University of Utah, 
whose successes have advanced knowledge in every aspect of 
the field of genetics. 

In their attempts to synthesize the story of genetics and its 
applications to human variability, health and disease, devel¬ 
opment, and cancer, the authors of this text have succeeded 
admirably. This concise, well-written, well-illustrated, and 
carefully edited and indexed book is highly recommended to 
undergraduate students, new graduate students, medical stu¬ 
dents, genetic counseling students, nursing students, and stu¬ 
dents in the allied health sciences. Importantly, it is also a 
wonderful text for practicing physicians (primary care pro¬ 
viders and specialists) who want an authoritative introduc¬ 
tion to the basis and principles of modern genetics as applied 
to human health and development. This text, written by dis¬ 
tinguished and internationally respected colleagues and 
friends who love to teach, is a joy to read in its expression 
of enthusiasm and of wonder, which Aristotle said was the 
beginning of all knowledge. 

Einstein once said, “The most incomprehensible thing 
about the world is that it is comprehensible.” When 1 began 
to work in the field of medical genetics, the gene was widely 
viewed as incomprehensible. Indeed, some scientists, such as 
Goldschmidt, cast doubt on the very existence of the gene, 
although the great American biologist E.B. Wilson had pre¬ 
dicted its chemical nature more than 100 years previously. In 
this text, genes and their function in health and disease are 
made comprehensible in a manner that should have wide 
appeal to all. 

John Opitz, MD 

Salt Lake City, Utah 
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PREFACE 


Medical genetics is a rapidly progressing field. No textbook 
can remain factually current for long, so we have attempted 
to emphasize the central principles of genetics and their clini¬ 
cal application. In particular, this textbook integrates recent 
developments in molecular genetics and genomics with clini¬ 
cal practice. 

This new edition maintains the format and presentation 
that were well received in four previous editions. Basic prin¬ 
ciples of molecular biology are introduced early in the book 
so that they can be discussed and applied in subsequent chap¬ 
ters. The chapters on autosomal and X-linked disorders 
include updated discussions of topics such as genomic 
imprinting, anticipation, and expanded trinucleotide repeats. 
The chapter on cytogenetics highlights important advances 
in this area, including array comparative genomic hybridiza¬ 
tion and newly described microdeletion syndromes. Whole- 
genome DNA sequencing and disease-gene identification, 
which constitute a central focus of modern medical genetics, 
are treated at length. Chapters are included on the rapidly 
developing fields of immunogenetics and cancer genetics. 
Considerable discussion is devoted to the genetics of common 
adult diseases, such as heart disease, diabetes, stroke, and 
hypertension. The book concludes with chapters on genetic 
diagnosis (again emphasizing current approaches such as 
whole-exome and whole-genome sequencing), gene therapy, 
personalized medicine, and clinical genetics and genetic 
counseling. 

As in previous editions, a website is available to provide 
access to continually changing information in medical genet¬ 
ics (http://evolve.elsevier.com/Jorde/). The website includes 
downloadable versions of all of the figures in the textbook, 
hyperlinks to other relevant sites, and a battery of test ques¬ 
tions and answers. 


Several pedagogical aids are incorporated in this book: 

• Clinical Commentary boxes present detailed coverage of 
the most important genetic diseases and provide examples 
of modern clinical management. 

• Mini-summaries, highlighted in gold, are placed on nearly 
every page to help the reader understand and summarize 
important concepts. 

• Study questions, provided at the end of each chapter, assist 
the reader in review and comprehension. 

• A detailed glossary is included at the end of the hook. 

• Key terms are emphasized in boldface. 

• Important up-to-date suggested readings are listed at the 
end of each chapter. 

In addition, the fifth edition retains these important 
features: 

• All chapters have been thoroughly updated, with special 
attention given to rapidly changing topics such as high- 
throughput DNA sequencing, genetic diagnosis, gene 
therapy, cancer genetics, and the genetics of other common 
diseases. 

• To facilitate the creation of illustrations for teaching pur¬ 
poses, all images on the website (including line drawings 
from the textbook) can be downloaded. 

• An expanded comprehensive index includes all text cita¬ 
tions of all diseases. 

This textbook evolved from courses we teach for medical 
students, nursing students, genetic counseling students, and 
graduate and undergraduate students in human genetics. 
These students are the primary audience for this hook, but it 
should also be useful for house staff, physicians, and other 
health-care professionals who wish to become more familiar 
with medical genetics. 
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CHAPTER 


Background and History 


Genetics is playing an increasingly important role in the prac¬ 
tice of clinical medicine. Medical genetics, once largely con¬ 
fined to relatively rare conditions seen by only a few specialists, 
is now becoming a central component of our understanding 
of most major diseases. These include not only the pediatric 
diseases but also common adult diseases such as heart disease, 
diabetes, many cancers, and many psychiatric disorders. 
Because all components of the human body are influenced 
by genes, genetic disease is relevant to all medical specialties. 
Today’s health-care practitioners must understand the science 
of medical genetics. 

WHAT IS MEDICAL GENETICS? 

Medical genetics involves any application of genetics to 
medical practice. It thus includes studies of the inheritance 
of diseases in families, mapping of disease genes to specific 
locations on chromosomes, analyses of the molecular mecha¬ 
nisms through which genes cause disease, and the diagnosis 
and treatment of genetic disease. As a result of rapid progress 
in molecular genetics, DNA-based diagnosis is available for 
several thousand inherited conditions, and gene therapy— 
the insertion of normal genes into patients in order to correct 
genetic disease—is showing promise for some conditions. 
Medical genetics also includes genetic counseling, in which 
information regarding risks, prognoses, and treatments is 
communicated to patients and their families. 

WHY IS A KNOWLEDGE OF MEDICAL 
GENETICS IMPORTANT FOR TODAY'S 
HEALTH-CARE PRACTITIONER? 

There are several reasons health-care practitioners must 
understand medical genetics. Genetic diseases make up a 
large percentage of the total disease burden in pediatric and 
adult populations (Table 1-1). This percentage will continue 
to grow as our understanding of the genetic basis of disease 
grows. In addition, modern medicine is placing increasing 
emphasis on prevention. Because genetics provides a basis for 
understanding the fundamental biological makeup of the 
organism, it naturally leads to a better understanding of the 
disease process. In some cases, this knowledge can lead to 
prevention of the disorder. It also leads to more effective 
disease treatment. Prevention and effective treatment are 
among the highest goals of medicine. The chapters that follow 


provide many examples of the ways genetics contributes to 
these goals. But first, this chapter reviews the foundations 
upon which current practice is built. 

A BRIEF HISTORY 

The inheritance of physical traits has been a subject of curios¬ 
ity and interest for thousands of years. The ancient Hebrews 
and Greeks, as well as later medieval scholars, described many 
genetic phenomena and proposed theories to account for 
them. Many of these theories were incorrect. Gregor Mendel 
(Fig. 1-1), an Austrian monk who is usually considered the 
father of genetics, advanced the field significantly by per¬ 
forming a series of cleverly designed experiments on living 
organisms (garden peas). He then used this experimental 
information to formulate a series of fundamental principles 
of heredity. 

Mendel published the results of his experiments in 1865 
in a relatively obscure journal. It is one of the ironies of bio¬ 
logical science that his discoveries, which still form the foun¬ 
dation of genetics, received little recognition for 35 years. 
At about the same time, Charles Darwin formulated his theo¬ 
ries of evolution, and Darwin’s cousin, Francis Gabon, per¬ 
formed an extensive series of family studies (concentrating 
especially on twins) in an effort to understand the influence 
of heredity on various human traits. Neither scientist was 
aware of Mendel’s work. 

Genetics as it is known today is largely the result of research 
performed during the 20th century. Mendel’s principles were 
independently rediscovered in 1900 by three different scien¬ 
tists working in three different countries. This was also the 
year in which Landsteiner discovered the ABO blood group 
system. In 1902, Archibald Garrod described alkaptonuria as 
the first “inborn error of metabolism.” In 1909, Johannsen 
coined the term gene to denote the basic unit of heredity. 

The next several decades were a period of considerable 
experimental and theoretical work. Several organisms, includ¬ 
ing Drosophila melanogaster (fruit flies) and Neurospora crassa 
(bread mold) served as useful experimental systems in which 
to study the actions and interactions of genes. For example, 
H. J. Muller demonstrated the genetic consequences of ion¬ 
izing radiation in the fruit fly. During this period, much of 
the theoretical basis of population genetics was developed by 
three central figures: Ronald Fisher, J. B. S. Haldane, and 
Sewall Wright. In addition, the modes of inheritance of several 
important genetic diseases, including phenylketonuria, sickle 

1 
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CHAPTER 1 


Background and History 


TABLE 1-1 A Partial List of Some Important Genetic Diseases 

DISEASE 

APPROXIMATE PREVALENCE | 

Chromosome Abnormalities 

Down syndrome 

1/700 to 1/1000 

Klinefelter syndrome 

1/1000 males 

Trisomy 13 

1/10,000 

Trisomy 18 

1/6000 

Turner syndrome 

1/2500 to 1/10,000 females 

Single-Gene Disorders 

Adenomatous polyposis coli 

1/6000 

Adult polycystic kidney disease 

1/1000 

ai-Antitrypsin deficiency 

1/2500 to 1/10,000 (whites)* 

Cystic fibrosis 

1/2000 to 1/4000 (whites) 

Duchenne muscular dystrophy 

1/3500 males 

Familial hypercholesterolemia 

1/500 

Fragile X syndrome 

1/4000 males: 1/8000 females 

Hemochromatosis (hereditary) 

1/300 whites are homozygotes: approximately 1/1000 to 1/2000 

Hemophilia A 

are affected 

1/5000 to 1/10,000 males 

Hereditary nonpolyposis colorectal cancer 

Up to 1/200 

Huntington disease 

1/20,000 (whites) 

Marfan syndrome 

1/10,000 to 1/20,000 

Myotonic dystrophy 

1/7000 to 1/20,000 (whites) 

Neurofibromatosis type 1 

1/3000 to 1/5000 

Osteogenesis imperfecta 

1/5000 to 1/10,000 

Phenylketonuria 

1/10,000 to 1/15,000 (whites) 

Retinoblastoma 

1/20,000 

Sickle cell disease 

1/400 to 1/600 blacks* in America: up to 1/50 in central Africa 

Tay-Sachs disease 

1/3000 Ashkenazi Jews 

Thalassemia 

1/50 to 1/100 (South Asian and circum-Mediterranean populations) 

Multifactorial Disorders 

Congenital Malformations 

Cleft lip with or without cleft palate 

1/500 to 1/1000 

Club foot (talipes equinovarus) 

1/1000 

Congenital heart defects 

1/200 to 1/500 

Neural tube defects (spina bifida, anencephaly) 

1/200 to 1/1000 

Pyloric stenosis 

1/300 

Aduit Diseases 

Alcoholism 

1/10 to 1/20 

Alzheimer disease 

1/10 (Americans older than 65 years) 

Bipolar disorder 

1/100 to 1/200 

Cancer (all types) 

1/3 

Diabetes (types 1 and 2) 

1/10 

Heart disease or stroke 

1/3 to 1/5 

Schizophrenia 

1/100 

Mitochondrial Diseases 

Kaerns-Sayre syndrome 

Rare 

Leber hereditary optic neuropathy (LHON) 

Rare 

Mitochondrial encephalopathy, lactic acidosis, and 

Rare 

stroke-like episodes (MELAS) 

Myoclonic epilepsy and ragged red fiber disease (MERRF) 

Rare 


*The term "white" refers to individuals of predominantly European descent: the term "black" refers to individuals of predominantly 
sub-Saharan African descent. These terms are used for convenience: some of the challenges in accurately describing human populations are 
discussed in Chapter 14. 
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FIG 1-1 Gregor Johann Mendel. (From Raven PH, Johnson 
GB. Biology. 3rd ed. St Louis: Mosby; 1992.) 

cell disease, Huntington disease, and cystic fibrosis, were 
established. In 1944, Oswald Avery showed that genes are 
composed of deoxyribonucleic acid (DNA). 

Probably the most significant achievement of the 1950s 
was the specification of the physical structure of DNA by 
James Watson and Francis Crick in 1953. Their seminal paper, 
which was only one page long, formed the basis for what is 
now known as molecular genetics (the study of the structure 
and function of genes at the molecular level). Another sig¬ 
nificant accomplishment in that decade was the correct speci¬ 
fication of the number of human chromosomes. Since the 
early 1920s, it had been thought that humans had 48 chro¬ 
mosomes in each cell. Only in 1956 was the correct number, 
46, finally determined. The ability to count and identify chro¬ 
mosomes led to a flurry of new findings in cytogenetics, 
including the discovery in 1959 that Down syndrome is 
caused by an extra copy of chromosome 21. 

Technological developments since 1960 have brought 
about significant achievements at an ever-increasing rate. The 
most spectacular advances have occurred in the field of 
molecular genetics. Thousands of genes have been mapped 
to specific chromosome locations. The Human Genome 
Project, a large collaborative venture begun in 1990, provided 
a nearly complete human DNA sequence in 2003 (the term 
genome refers to all of the DNA in an organism). Important 
developments in computer technology have helped to deci¬ 
pher the barrage of data being generated by this and related 
projects. In addition to mapping genes, geneticists have pin¬ 
pointed the molecular defects underlying several thousand 
genetic diseases. This research has contributed greatly to our 
understanding of the ways gene defects can cause disease, 
opening paths to more effective treatment and potential 
cures. 

TYPES OF GENETIC DISEASES 

Humans are estimated to have approximately 20,000 to 
25,000 genes. Alterations in these genes, or in combinations 


of them, can produce genetic disorders. These disorders are 
classified into several major groups: 

• Chromosome disorders, in which entire chromosomes 
(or large segments of them) are missing, duplicated, or 
otherwise altered. These disorders include diseases such as 
Down syndrome and Turner syndrome. 

• Disorders in which single genes are altered; these are often 
termed mendelian conditions, or single-gene disorders. 
Well-known examples include cystic fibrosis, sickle cell 
disease, and hemophilia. 

• Multifactorial disorders, which result from a combina¬ 
tion of multiple genetic and environmental causes. Many 
birth defects, such as cleft lip and cleft palate, as well as 
many adult disorders, including heart disease and diabe¬ 
tes, belong in this category. 

• Mitochondrial disorders, a relatively small number of 
diseases caused by alterations in the small cytoplasmic 
mitochondrial chromosome. 

Table 1-1 provides some examples of each of these types 
of diseases. 

Of these major classes of diseases, the single-gene disor¬ 
ders have probably received the greatest amount of attention. 
These disorders are classified according to the way they are 
inherited in families: autosomal dominant, autosomal reces¬ 
sive, or X-linked. These modes of inheritance are discussed 
extensively in Chapters 4 and 5. The first edition of McKu- 
sick’s Mendelian Inheritance in Man, published in 1966, listed 
only 1368 autosomal traits and 119 X-linked traits. Today, the 
online version of McKusick’s compendium lists nearly 23,000 
genes and traits, of which nearly 21,000 are autosomal, more 
than 1200 are X-linked, 59 are Y-linked, and 65 are in the 
mitochondrial genome. DNA variants responsible for more 
than 4000 of these traits, most of which are inherited diseases, 
have been identified. With continued advances, these numbers 
are certain to increase. 

Although some genetic disorders, particularly the single¬ 
gene conditions, are strongly determined by genes, many 
others are the result of multiple genetic and nongenetic 
factors. One can therefore think of genetic diseases as lying 
along a continuum (Fig. 1-2), with disorders such as cystic 
fibrosis and Duchenne muscular dystrophy situated at one 
end (strongly determined by genes) and conditions such as 
measles situated at the other end (strongly determined by 
environment). Many of the most prevalent disorders, includ¬ 
ing many birth defects and many common diseases such as 
diabetes, hypertension, heart disease, and cancer, lie some¬ 
where in the middle of the continuum. These diseases are the 
products of varying degrees of genetic and environmental 
influences. 

THE CLINICAL IMPACT OF GENETIC DISEASE 

Less than a century ago, diseases of largely nongenetic causa¬ 
tion (i.e., those caused by malnutrition, unsanitary condi¬ 
tions, and pathogens) accounted for the great majority of 
deaths in children. During the 20th century, however, public 
health vastly improved. As a result, genetic diseases have come 
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Heart disease 
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Infectious disease 




Environmental Genetic 

FIG 1-2 Continuum of disease causation. Some diseases (e.g., cystic fibrosis) are strongly 
determined by genes, whereas others (e.g., infectious diseases) are strongly determined by 
environment. 


TABLE 1 -2 Percentages of Childhood Deaths in United Kingdom Hospitals Attributable to 
Nongenetic and Genetic Causes 


CAUSE 

LONDON 1914 

LONDON 1954 

NEWCASTLE 1966 

EDINBURGH 1976 

Nongenetic* 

All causes 

83.5 

62.5 

58.0 

50.0 

Genetic 

Single gene 

2.0 

12.0 

8.5 

8.9 

Chromosomal 

— 

— 

2.5 

2.9 

Multifactorial 

14.5 

25.5 

31.0 

38.2 


Data from Rimoin DL, Connor JM, Pyeritz RE, Korf BR. Emery and Rimoin's Principles and Practice of Medical Genetics. London: Churchill 
Livingstone: 2007. 

*lnfections, for example. 


to account for an ever-increasing percentage of deaths among 
children in developed countries. For example, the percentage 
of pediatric deaths due to genetic causes in various hospitals 
in the United Kingdom increased from 16.5% in 1914 to 50% 
in 1976 (Table 1-2). 

In addition to contributing to a large fraction of child¬ 
hood deaths, genetic diseases also account for a large share 
of admissions to pediatric hospitals. For example, a survey of 
Seattle hospitals showed that 27% of all pediatric inpatients 
presented with a genetic disorder, and a survey of admis¬ 
sions to a major pediatric hospital in Mexico showed that 
37.8% involved a disease that was either genetic or “partly 
genetic.” It is estimated that mendelian diseases alone 
account for =20% of infant mortality and =10% of pediatric 
hospitalizations. 

Another way to assess the importance of genetic diseases 
is to ask, “What fraction of persons in the population will be 
found to have a genetic disorder?” A variety of factors can 
influence the answer to this question. For example, some 
diseases are found more frequently in certain ethnic groups. 
Cystic fibrosis is especially common among those of Euro¬ 
pean heritage, whereas sickle cell disease is especially common 
among those of African descent. Some diseases are more 
common in older persons. For example, colon cancer, breast 
cancer, and Alzheimer disease are caused by dominant genes 
in a small fraction (5-10%) of cases but are not usually mani¬ 
fested until later in life. The prevalence estimates for these 


TABLE 1-3 Approximate Prevalence of 
Genetic Disease in the General Population 

TYPE OF GENETIC DISEASE 

LIFETIME PREVALENCE 
PER 1000 PERSONS 

Autosomal dominant 

3-9.5 

Autosomal recessive 

2-2.5 

X-l inked 

0.5-2 

Chromosome disorder 

6-9 

Congenital malformation* 

20-50 

Total 

31.5-73 


*Congenital means "present at birth." Most congenital 
malformations are thought to be multifactorlal and therefore 
probably have both genetic and environmental components. 


genetic diseases would be higher in an older population. 
Variations in diagnostic and recording practices can also 
cause prevalence estimates to vary. Accordingly, the preva¬ 
lence figures shown in Table 1-3 are given as rather broad 
ranges. Keeping these sources of variation in mind, it is 
notable that a recognizable genetic disease will be diagnosed 
in at least 3% to 7% of the population at some point. This 
tabulation does not include most cases of the more common 
adult diseases, such as heart disease, diabetes, and cancer, 
although it is known that these diseases also have genetic 
components. If such diseases are included, the clinical impact 
of genetic disease is considerable indeed. 
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CHAPTER 


Basic Cell Biology: Structure and Function of 
Genes and Chromosomes 


All genetic diseases involve defects at the level of the cell. For 
this reason, one must understand basic cell biology to under¬ 
stand genetic disease. Errors can occur in the replication of 
genetic material or in the translation of genes into proteins. 
Such errors commonly produce single-gene disorders. In 
addition, errors that occur during cell division can lead to 
disorders involving entire chromosomes. To provide the basis 
for understanding these errors and their consequences, this 
chapter focuses on the processes through which genes are 
replicated and translated into proteins, as well as the process 
of cell division. 

In the 19th century, microscopic studies of cells led scien¬ 
tists to suspect that the nucleus of the cell (Fig. 2-1) contains 
the important mechanisms of inheritance. They found that 
chromatin, the substance that gives the nucleus a granular 
appearance, is observable in the nuclei of nondividing cells. 
Just before a cell undergoes division, the chromatin con¬ 
denses to form microscopically observable, threadlike struc¬ 
tures called chromosomes (from the Greek words for “colored 
bodies”). With the rediscovery of Mendel’s breeding experi¬ 
ments at the beginning of the 20th century, it soon became 
apparent that chromosomes contain genes. Genes are trans¬ 
mitted from parent to offspring and are considered the 
basic unit of inheritance. It is through the transmission of 
genes that physical traits such as eye color are inherited in 
families. Diseases can also be transmitted through genetic 
inheritance. 

Physically, genes are composed of deoxyribonucleic acid 
(DNA). DNA provides the genetic blueprint for all proteins 
in the body. Thus, genes ultimately influence all aspects of 
body structure and function. Humans are estimated to have 
20,000 to 25,000 genes (sequences of DNA that code for 
ribonucleic acid [RNA] or proteins). An error (or mutation) 
in one of these genes often leads to a recognizable genetic 
disease. 

Genes, the basic unit of inheritance, are con¬ 
tained in chromosomes and consist of DNA. 

Each human somatic cell (cells other than the gametes, 
or sperm and egg cells) contains 23 pairs of different chro¬ 
mosomes, for a total of 46. One member of each pair is 
derived from the individual’s father, and the other member is 
derived from the mother. One of the chromosome pairs con¬ 
sists of the sex chromosomes. In normal males, the sex chro¬ 
mosomes are a Y chromosome inherited from the father and 


an X chromosome inherited from the mother. Two X chro¬ 
mosomes are found in normal females, one inherited from 
each parent. The other 22 pairs of chromosomes are auto- 
somes. The members of each pair of autosomes are said to 
be homologs, or homologous, because their DNA is very 
similar. The X and Y chromosomes are not homologs of each 
other. 

Somatic cells, having two of each chromosome, are diploid 
cells. Human gametes have the haploid number of chromo¬ 
somes, 23. The diploid number of chromosomes is main¬ 
tained in successive generations of somatic cells by the process 
of mitosis, whereas the haploid number is obtained through 
the process of meiosis. Both of these processes are discussed 
in detail later in this chapter. 

Somatic cells are diploid, having 23 pairs 
of chromosomes (22 pairs of autosomes and 
one pair of sex chromosomes). Gametes are 
haploid and have a total of 23 chromosomes. 


DNA, RNA, AND PROTEINS: HEREDITY 
AT THE MOLECULAR LEVEL _ 

DNA 

Composition and Structure of DNA 

The DNA molecule has three basic components: the pentose 
sugar, deoxyribose; a phosphate group; and four types of 
nitrogenous bases (so named because they can combine with 
hydrogen ions in acidic solutions). Two of the bases, cytosine 
and thymine, are single carbon-nitrogen rings called pyrimi¬ 
dines. The other two bases, adenine and guanine, are double 
carbon-nitrogen rings called purines (Eig. 2-2). The four 
bases are commonly represented by their first letters: C, T, A, 
and G. 

One of the contributions of Watson and Crick in the mid- 
20th century was to demonstrate how these three compo¬ 
nents are physically assembled to form DNA. They proposed 
the now-famous double helix model, in which DNA can be 
envisioned as a twisted ladder with chemical bonds as its 
rungs (Fig. 2-3). The two sides of the ladder are composed of 
the sugar and phosphate components, held together by strong 
phosphodiester bonds. Projecting from each side of the 
ladder, at regular intervals, are the nitrogenous bases. The 
base projecting from one side is bound to the base projecting 
from the other side by relatively weak hydrogen bonds. The 
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FIG 2-1 The anatomy of the cell. (From McCance KL, Huether SE. Pathophysiology: The Biologic 
Basis for Disease in Adults and Children. 7th ed. St. Louis: Elsevier Mosby; 2014.) 


paired nitrogenous bases therefore form the rungs of the 
ladder. 

Figure 2-2 illustrates the chemical bonds between bases 
and shows that the ends of the ladder terminate in either 3' 
or 5'. These labels are derived from the order in which the 
five carbon atoms composing deoxyribose are numbered. 
Each DNA subunit, consisting of one deoxyribose, one phos¬ 
phate group, and one base, is called a nucleotide. 

Different sequences of nucleotide bases (e.g., 
ACCAAGTGC) specify different proteins. Specification of the 
body’s many proteins must require a great deal of genetic 
information. Indeed, each haploid human cell contains 
approximately 3 billion nucleotide pairs, more than enough 
information to specify the composition of all human 
proteins. 



The most important constituents of DNA are 
the four nucleotide bases: adenine, thymine, 
cytosine, and guanine. DNA has a double helix 
structure. 
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FIG 2-2 Chemical structure of the four bases, which shows 
hydrogen bonds between base pairs. Three hydrogen bonds 
are formed between cytosine-guanine pairs, and two bonds 
are formed between adenine-thymine pairs. 


DNA Coiling 

Textbook illustrations usually depict DNA as a double helix 
molecule that continues in a long, straight line. However, if 
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FIG 2-3 The DNA double helix, with sugar-phosphate back¬ 
bone and nitrogenous bases. 


form a spacer element before the next nucleosome complex. 
The nucleosomes in turn form a helical solenoid; each turn 
of the solenoid includes about six nucleosomes. The sole¬ 
noids themselves are organized into chromatin loops, which 
are attached to a protein scaffold. Each of these loops contains 
approximately 100,000 base pairs (bp), or 100 kilobases (kb), 
of DNA. The end result of this coiling and looping is that the 
DNA, at its maximum stage of condensation, is only about 
1/10,000 as long as it would be if it were fully stretched out. 



DNA is a tightly coiled structure. This coiling 
occurs at several levels: the nucleosome, the 
solenoid, and 100-kb loops. 


Replication of DNA 

As cells divide to make copies of themselves, identical copies 
of DNA must be made and incorporated into the new cells. 
This is essential if DNA is to serve as the fundamental genetic 
material. DNA replication begins as the weak hydrogen 
bonds between bases break, producing single DNA strands 
with unpaired bases. The consistent pairing of adenine with 
thymine and guanine with cytosine, known as complemen¬ 
tary base pairing, is the key to accurate replication. The prin¬ 
ciple of complementary base pairing dictates that the unpaired 
base will attract a free nucleotide only if that nucleotide has 
the proper complementary base. For example, a portion of a 
single strand with the base sequence ATTGCT will bond with 
a series of free nucleotides with the bases TAACGA. The 
single strand is said to be a template upon which the comple¬ 
mentary strand is built. When replication is complete, a new 
double-stranded molecule identical to the original is formed 
(Fig. 2-5). 

Several different enzymes are involved in DNA replication. 
One enzyme unwinds the double helix, and another holds the 
strands apart. Still another enzyme, DNA polymerase, travels 
along the single DNA strand, adding free nucleotides to the 
3' end of the new strand. Nucleotides can be added only to 
the 3' end of the strand, so replication always proceeds from 
the 5' to the 3' end. When referring to the orientation of 
sequences along a gene, the 5' direction is termed upstream, 
and the 3' direction is termed downstream. 

In addition to adding new nucleotides, DNA polymerase 
performs part of a proofreading procedure, in which a newly 
added nucleotide is checked to make certain that it is in fact 
complementary to the template base. If it is not, the nucleo¬ 
tide is excised and replaced with a correct complementary 
nucleotide base. This process substantially enhances the accu¬ 
racy of DNA replication. When a DNA replication error is 
not successfully repaired, a mutation has occurred. As will be 
seen in Chapter 3, many such mutations cause genetic 
diseases. 


the DNA in a cell were actually stretched out in this way, it 
would be about 2 meters long. To package aU of this DNA 
into a tiny cell nucleus, it is coiled at several levels. First, the 
DNA is wound around a histone protein core to form a 
nucleosome (Fig. 2-4). About 140 to 150 DNA bases are 
wound around each histone core, and then 20 to 60 bases 


DNA replication is critically dependent on the 
principle of complementary base pairing (A 
with T; C with G). This allows a single strand 
of the double-stranded DNA molecule to form 
a template for the synthesis of a new, comple¬ 
mentary strand. 
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FIG 2-4 Patterns of DNA coiling. DNA is wound around histones to form nucleosomes. These 
are organized Into solenoids, which in turn make up the chromatin loops. 


The rate of DNA replication in humans, about 40 to 50 
nucleotides per second, is comparatively slow. In bacteria the 
rate is much higher, reaching 500 to 1000 nucleotides per 
second. Given that some human chromosomes have as many 
as 250 million nucleotides, replication would be an extraor¬ 
dinarily time-consuming process if it proceeded linearly from 
one end of the chromosome to the other: For a chromosome 


of this size, a single round of replication would take almost 
2 months. Instead, replication begins at many different points 
along the chromosome, termed replication origins. The 
resulting multiple separations of the DNA strands are called 
replication bubbles (Fig. 2-6). By occurring simultaneously 
at many different sites along the chromosome, the replication 
process can proceed much more quickly. 















10 


CHAPTER 2 Basic Cell Biology: Structure and Function of Genes and Chromosomes 



FIG 2-5 DNA replication. The hydrogen bonds between the two original strands are broken, 
allowing the bases in each strand to undergo complementary base pairing with free bases. This 
process, which proceeds in the 5' to 3' direction on each strand, forms two new double strands 
of DNA. 





FIG 2-6 Replication bubbles form at multiple points along the DNA strand, allowing replication 
to proceed more rapidly. 
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Replication bubbles allow DNA replication to 
take place at multiple locations on the chromo¬ 
some, greatly speeding the replication process. 

From Genes to Proteins 

While DNA is formed and replicated in the cell nucleus, 
protein synthesis takes place in the cytoplasm. The informa¬ 
tion contained in DNA must be transported to the cytoplasm 
and then used to dictate the composition of proteins. This 
involves two processes, transcription and translation. Briefly, 
the DNA code is transcribed into messenger RNA, which 
then leaves the nucleus to be translated into proteins. These 
processes, summarized in Figure 2-7, are discussed at length 
later in this chapter. Transcription and translation are both 
mediated by ribonucleic acid (RNA), a type of nucleic acid 


that is chemically similar to DNA. Like DNA, RNA is com¬ 
posed of sugars, phosphate groups, and nitrogenous bases. It 
differs from DNA in that the sugar is ribose instead of deoxy- 
ribose, and uracil rather than thymine is one of the four bases. 
Uracil is structurally similar to thymine, so, like thymine, it 
can pair with adenine. Another difference between RNA and 
DNA is that whereas DNA usually occurs as a double strand, 
RNA usually occurs as a single strand. 

DNA sequences encode proteins through 
the processes of transcription and translation. 
These processes both involve RNA, a single- 
stranded moleculethat is similarto DNA except 
that it has a ribose sugar rather than deoxyri- 
bose and a uracil base rather than thymine. 





FIG 2-7 A summary of the steps leading from DNA to proteins. Replication and transcription 
occur in the cell nucleus. The mRNA is then transported to the cytoplasm, where translation of 
the mRNA into amino acid sequences composing a protein occurs. 
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Transcription 

Transcription is the process by which an RNA sequence is 
formed from a DNA template (Fig. 2-8). The type of RNA 
produced by the transcription process is messenger RNA 
(mRNA). To initiate mRNA transcription, one of the RNA 
polymerase enzymes (RNA polymerase 11) binds to a pro¬ 
moter site on the DNA (a promoter is a nucleotide sequence 
that lies just upstream of a gene). The RNA polymerase then 
puUs a portion of the DNA strands apart from each other, 
exposing unattached DNA bases. One of the two DNA strands 
provides the template for the sequence of mRNA nucleotides. 
Although either DNA strand could in principle serve as the 
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FIG 2-8 Transcription of DNA to mRNA. RNA polymerase II 
proceeds along the DNA strand in the 3' to 5' direction, 
assembling a strand of mRNA nucleotides that is comple¬ 
mentary to the DNA template strand. 


template for mRNA synthesis, only one is chosen to do so in 
a given region of the chromosome. This choice is determined 
by the promoter sequence, which orients the RNA poly¬ 
merase in a specific direction along the DNA sequence. 
Because the mRNA molecule can be synthesized only in the 
5' to 3' direction, the promoter, by specifying directionality, 
determines which DNA strand serves as the template. This 
template DNA strand is also known as the antisense strand. 
RNA polymerase moves in the 3' to 5' direction along the 
DNA template strand, assembling the complementary mRNA 
strand from 5' to 3' (see Fig. 2-8). Because of complementary 
base pairing, the mRNA nucleotide sequence is identical to 
that of the DNA strand that does not serve as the template— 
the sense strand—except for the substitution of uracil for 
thymine. 

Soon after RNA synthesis begins, the 5' end of the growing 
RNA molecule is capped by the addition of a chemically 
modified guanine nucleotide. This 5' cap appears to help 
prevent the RNA molecule from being degraded during syn¬ 
thesis, and later it helps to indicate the starting position for 
translation of the mRNA molecule into protein. Transcrip¬ 
tion continues until a group of bases called a termination 
sequence is reached. Near this point, a series of 100 to 200 
adenine bases are added to the 3' end of the RNA molecule. 
This structure, known as the poly-A tail, may be involved in 
stabilizing the mRNA molecule so that it is not degraded 
when it reaches the cytoplasm. RNA polymerase usually con¬ 
tinues to transcribe DNA for several thousand additional 
bases, but the mRNA bases that are attached after the poly-A 
tail are eventually lost. Finally, the DNA strands and the RNA 
polymerase separate from the RNA strand, leaving a tran¬ 
scribed single mRNA strand. This mRNA molecule is termed 
the primary transcript. 

In some human genes, such as the one that can cause 
Duchenne muscular dystrophy, several different promoters 
exist and are located in different parts of the gene. Thus, 
transcription of the gene can start in different places, result¬ 
ing in the production of somewhat different proteins. This 
allows the same gene sequence to code for variations of a 
protein in different tissues (e.g., muscle tissue versus brain 
tissue). 



In the process of transcription, RNA poly¬ 
merase II recognizes a promoter site near 
the 5' end of a gene on the sense strand and, 
through complementary base pairing, helps to 
produce an mRNA strand from the antisense 
DNA strand. 


Transcription and the Regulation of Gene Expression 

Some genes are transcribed in all cells of the body. These 
housekeeping genes encode products that are required for a 
cell’s maintenance and metabolism. Most genes, however, are 
transcribed only in specific tissues at specific points in time. 
Therefore, in most cells, only a small fraction of genes are 
actively transcribed. This specificity explains why there is a 
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Enhancer 



FIG 2-9 Key elements of transcription control include general (basal) transcription factors and 
specific enhancers and silencers. The activity of enhancers is mediated by activators and coacti¬ 
vators, which are specific transcription factors. (Data from Tjian R. Molecular machines that 
control genes. Sci Am. 1995;272[2]:54-61.) 


large variety of different cell types making different protein 
products, even though almost all cells have exactly the same 
DNA sequence. For example, the globin genes are transcribed 
in the progenitors of red blood cells (where they help to form 
hemoglobin), and the low-density lipoprotein receptor genes 
are transcribed in liver cells. 

Many different proteins participate in the process of tran¬ 
scription. Some of these are required for the transcription of 
all genes, and these are termed general transcription factors. 
Others, labeled specific transcription factors, have more 
specialized roles, activating only certain genes at certain 
stages of development. A key transcriptional element is RNA 
polymerase II, which was described previously. Although this 
enzyme plays a vital role in initiating transcription by binding 
to the promoter region, it cannot locate the promoter region 
on its own. Furthermore, it is incapable of producing signifi¬ 
cant quantities of mRNA by itself Effective transcription 
requires the interaction of a large complex of approximately 
50 different proteins. These include general (basal) transcrip¬ 
tion factors, which bind to RNA polymerase and to specific 
DNA sequences in the promoter region (sequences such as 
TATA and others needed for initiating transcription). The 
general transcription factors allow RNA polymerase to bind 


to the promoter region so that it can function effectively in 
transcription (Fig. 2-9). 

The transcriptional activity of specific genes can be greatly 
increased by interaction with sequences called enhancers, 
which may be located thousands of bases upstream or down¬ 
stream of the gene. Enhancers do not interact directly 
with genes. Instead, they are bound by a class of specific 
transcription factors that are termed activators. Activators 
bind to a second class of specific transcription factors called 
co-activators, which in turn bind to the general transcription 
factor complex described previously (see Fig. 2-9). This chain 
of interactions, from enhancer to activator to co-activator to 
the general transcription complex and finally to the gene 
itself, increases the transcription of specific genes at specific 
points in time. Whereas enhancers help to increase the tran¬ 
scriptional activity of genes, other DNA sequences, known as 
silencers, help to repress the transcription of genes through 
a similar series of interactions. 

Mutations in enhancer, silencer, or promoter sequences, as 
well as mutations in the genes that encode transcription 
factors, can lead to faulty expression of vital genes and con¬ 
sequently to genetic disease. Many examples of such diseases 
are discussed in the following chapters. 
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Transcription factors are required for the tran¬ 
scription of DNA to mRNA. General transcrip¬ 
tion factors are used by all genes, and specific 
transcription factors help to initiate the tran¬ 
scription of genes in specific cell types at spe¬ 
cific points in time. Transcription is also 
regulated by enhancer and silencer sequences, 
which may be located thousands of bases 
away from the transcribed gene. 

The large number and complexity of transcription factors 
allow fine-tuned regulation of gene expression. But how do 
the transcription factors locate specific DNA sequences? This 
is achieved by DNA-binding motifs: configurations in the 
transcription-factor protein that allow it to fit snugly and 
stably into a unique portion of the DNA double helix. Several 
examples of these binding motifs are listed in Table 2-1, and 
Figure 2-10 illustrates the binding of one such motif to DNA. 


Helix-loop-helix protein 



DNA sequence. 


Each major motif contains many variations that allow speci¬ 
ficity in DNA binding. 

An intriguing type of DNA-binding motif is contained in 
the high-mobility group (HMG) class of proteins. These pro¬ 
teins are capable of bending DNA and can facilitate interac¬ 
tions between distantly located enhancers and the appropriate 
basal factors and promoters (see Fig. 2-9). 

Transcription factors contain DNA-binding 
motifs that allow them to interact with specific 
DNA sequences. In some cases, they bend 
DNA so that distant enhancer sequences can 
interact with target genes. 

Gene activity can also be related to patterns of chromatin 
coiling or condensation (chromatin is the combination of 
DNA and the histone proteins around which the DNA is 
wound). Decondensed, or open, chromatin regions, termed 
euchromatin, are typically characterized by histone acetyla¬ 
tion, the attachment of acetyl groups to lysine residues in 
the histones. Acetylation of histones reduces their binding to 
DNA, helping to decondense the chromatin so that it is 
more accessible to transcription factors. Euchromatin is thus 
transcriptionally active. In contrast, heterochromatin is 
usually less acetylated, more condensed, and transcription¬ 
ally inactive. 

Gene expression can also be influenced by microRNAs 
(miRNA), which are small RNA molecules (17-27 nucleo¬ 
tides) that are not translated into proteins. Instead, because 
they are complementary to specific mRNA sequences, they 
can bind to and down-regulate these mRNAs, thus lowering 
their expression levels. MicroRNA variation has been found 
to play important roles in a number of types of cancer (see 
Chapter 11). 




TABLE 2-1 

The Major Classes of DNA-Binding Motifs Found in Transcription Factors 

MOTIF 

DESCRIPTION 

HUMAN DISEASE EXAMPLES 

Helix-turn-helix 

Two a helices are connected by a short chain of 
amino acids, which constitute the turn. The 
carboxyl-terminal helix is a recognition helix 
that binds to the DNA major groove. 

Homeodomain proteins (HOX): mutations in 
human HOXD13 and HOXA13 cause 
synpolydactyly and hand-foot-genital 
syndrome, respectively. 

Helix-loop-helix 

Two a helices (one short and one long) are 
connected by a flexible loop. The loop allows 
the two helices to fold back and interact with 
one another. The helices can bind to DNA or to 
other helix-loop-helix structures. 

Mutations in the TWIST gene cause Saethre- 
Chotzen syndrome (acrocephalosyndactyly 
type III) 

Zinc finger 

Zinc molecules are used to stabilize amino acid 
structures (e.g., a helices, p sheets), with 
binding of the a helix to the DNA major groove. 

BRCA1 (breast cancer gene); WTI (Wilms 
tumor gene); GL13 (Greig syndrome gene); 
vitamin D receptor gene (mutations cause 
rickets) 

Leucine zipper 

Two leucine-rich a helices are held together by 
amino acid side chains. The a helices form a 
Y-shaped structure whose side chains bind to 
the DNA major groove. 

RBI (retinoblastoma gene); JUN and FOS 
oncogenes 

P Sheets 

Side chains extend from the two-stranded p 
sheet to form contacts with the DNA helix. 

TBX family of genes: TBX5 (Holt-Oram 
syndrome); TBX3 (ulnar-mammary syndrome) 
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the primary mRNA transcript to produce a mature mRNA 
transcript. Consensus sequences mark the sites at which 
splicing occurs. 


Heterochromatin, which is highly condensed 
and hypoacetylated, tends to be transcription¬ 
ally inactive, whereas euchromatin, which is 
acetylated and less condensed, tends to be 
transcriptionally active. 

Gene Splicing 

The primary mRNA transcript is exactly complementary to 
the base sequence of the DNA template. In eukaryotes,* an 
important step takes place before this RNA transcript leaves 
the nucleus. Sections of the RNA are removed by nuclear 
enzymes, and the remaining sections are spliced together to 
form the functional mRNA that will migrate to the cyto¬ 
plasm. The excised sequences are called introns, and the 
sequences that are left to code for proteins are called exons 
(Fig. 2-11). Only after gene splicing is completed does the 
mature transcript move out of the nucleus into the cyto¬ 
plasm. Some genes contain alternative splice sites, which 
allow the same primary transcript to be spliced in different 
ways, ultimately producing different protein products from 
the same gene. Errors in gene splicing, like replication errors, 
are a form of mutation that can lead to genetic disease. 


^Eukaryotes are organisms that have a defined cell nucleus, as 
opposed to prokaryotes, which lack a defined nucleus. 


Introns are spliced out of the primary mRNA 
transcript before the mature transcript leaves 
the nucleus. Exons contain the mRNA that 
specifies proteins. 

The Genetic Code 

Proteins are composed of one or more polypeptides, which 
are in turn composed of sequences of amino acids. The body 
contains 20 different types of amino acids, and the amino 
acid sequences that make up polypeptides must in some way 
be designated by the DNA after transcription into mRNA. 

Because there are 20 different amino acids and only four 
different RNA bases, a single base could not be specific for a 
single amino acid. Similarly, specific amino acids could not 
be defined by couplets of bases (e.g., adenine followed by 
guanine, or uracil followed by adenine) because only 16 (4 X 
4) different couplets are possible. If triplet sets of bases are 
translated into amino acids, however, 64 (4 X 4 X 4) combina¬ 
tions can be achieved—more than enough to specify each 
amino acid. Conclusive proof that amino acids are specified 
by these triplets of bases, or codons, was obtained by manu¬ 
facturing synthetic nucleotide sequences and allowing them 
to direct the formation of polypeptides in the laboratory. The 
correspondence between specific codons and amino acids, 
known as the genetic code, is shown in Table 2-2. 

Of the 64 possible codons, three signal the end of a gene 
and are known as stop codons. These are UAA, UGA, and 
UAG. The remaining 61 codons all specify amino acids. This 
means that most amino acids can be specified by more than 
one codon, as Table 2-2 shows. The genetic code is thus said 
to be degenerate. Although a given amino acid may be speci¬ 
fied by more than one codon, each codon can designate only 
one amino acid. 

Individual amino acids, which compose pro¬ 
teins, are encoded by units of three mRNA 
bases, termed codons. There are 64 possible 
codons and only 20 amino acids, so the genetic 
code is degenerate. 

A significant feature of the genetic code is that it is uni¬ 
versal: the great majority of organisms use the same DNA 
codes to specify amino acids. An important exception to this 
rule occurs in mitochondria, cytoplasmic organelles that are 
the sites of cellular respiration (see Fig. 2-1). The mitochon¬ 
dria have their own extranuclear DNA molecules, and several 
codons of mitochondrial DNA encode different amino acids 
than do the same nuclear DNA codons. 

Translation 

Translation is the process in which mRNA provides a tem¬ 
plate for the synthesis of a polypeptide. mRNA cannot, 
however, bind directly to amino acids. Instead, it interacts 
with molecules of transfer RNA (tRNA), which are cloverleaf¬ 
shaped RNA strands of about 80 nucleotides. As Figure 2-12 
illustrates, each tRNA molecule has a site at the 3' end for the 
attachment of a specific amino acid by a covalent bond. At 
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TABLE 2-2 

The Genetic Code 





FIRST POSITION (5' END) 


SECOND POSITION 


THIRD POSITION (3' END) 
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G 
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U 
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Thr 

Lys 

Arg 
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Met 

Thr 

Lys 

Arg 
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Val 
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U 
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Val 

Ala 

Asp 

Gly 

C 

G 


Val 

Ala 

Glu 

Gly 

A 

G 


Val 

Ala 

Glu 

Gly 

G 


Ala, Alanine; Arg, arginine: Asn, asparagine; Asp, aspartic acid; Cys, cysteine; Gin, glutamine; Glu, glutamic acid; Gly, glycine: His, histidine; 
He, isoleucine; Leu, leucine; Lys, lysine: Met, methionine; Phe, phenylalanine; Pro, proline; Ser, serine; Thr, threonine; Trp, tryptophan; Tyr, 
tyrosine; Val, valine. 

*Examples: UUG is translated into leucine: UAA is a stop codon; GGG is translated into glycine. Under some circumstances the UGA codon 
can specify an amino acid called selenocysteine, which is often called the 21st amino acid. 
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Anticodon (pairs 
with codon) 

FIG 2-12 The structure of a tRNA molecule. In two dimen¬ 
sions, the tRNA has a cloverleaf shape. Note the 3' site of 
attachment for an amino acid. The anticodon pairs with a 
complementary mRNA codon. 


the opposite end of the cloverleaf is a sequence of three nucle¬ 
otides called the anticodon, which undergoes complemen¬ 
tary base pairing with an appropriate codon in the mRNA. 
The attached amino acid is then transferred to the polypep¬ 
tide chain being synthesized. In this way, mRNA specifies the 
sequence of amino acids by acting through tRNA. 

The cytoplasmic site of protein synthesis is the ribosome, 
which consists of roughly equal parts of enzymatic proteins 
and ribosomal RNA (rRNA). The function of rRNA is to help 
bind mRNA and tRNA to the ribosome. During translation, 
depicted in Figure 2-13, the ribosome first binds to an initia¬ 
tion site on the mRNA sequence. This site consists of a spe¬ 
cific codon, AUG, which specifies the amino acid methionine 
(this amino acid is usually removed from the polypeptide 
before the completion of polypeptide synthesis). The ribo¬ 
some then binds the tRNA to its surface so that base pairing 
can occur between tRNA and mRNA. The ribosome moves 
along the mRNA sequence, codon by codon, in the 5' to 3' 
direction. As each codon is processed, an amino acid is trans¬ 
lated by the interaction of mRNA and tRNA. 

In this process, the ribosome provides an enzyme that 
catalyzes the formation of covalent peptide bonds between 
the adjacent amino acids, resulting in a growing polypeptide. 
When the ribosome arrives at a stop codon on the mRNA 
sequence, translation and polypeptide formation cease. The 
amino (NH 2 ) terminus of the polypeptide corresponds to the 
5' end of the mRNA strand, and the carboxyl (COOH) ter¬ 
minus corresponds to the 3' end. After synthesis is completed, 
the mRNA, the ribosome, and the polypeptide separate from 
one another. The polypeptide is then released into the 
cytoplasm. 
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FIG 2-13 Translation of mRNA to amino acids. The ribosome moves along the mRNA strand in 
the 5' to 3' direction, assembling a growing polypeptide chain. In this example, the mRNA 
sequence GUG AGC AAG GGU UCA has assembled five amino acids (Val, Ser, Lys, Gly, and 
Ser, respectively) into a polypeptide. 



























































18 


CHAPTER 2 Basic Cell Biology: Structure and Function of Genes and Chromosomes 


In the process of translation, the mRNA 
sequence serves as a template to specify 
sequences of amino acids. These sequences, 
which form polypeptides, are assembled by 
ribosomes. The tRNA and rRNA molecules 
interact with mRNA in the translation process. 

Before a newly synthesized polypeptide can begin its exis¬ 
tence as a functional protein, it often undergoes further 
processing, termed posttranslational modification. These 
modifications can take a variety of forms, including cleav¬ 
age into smaller polypeptide units or combination with 


other polypeptides to form a larger protein. Other possible 
modifications include the addition of carbohydrate side 
chains to the polypeptide. Such modifications may be 
needed, for example, to produce proper folding of the 
mature protein or to stabilize its structure. An example of a 
clinically important protein that undergoes considerable 
posttranslational modification is type I collagen (Clinical 
Commentary 2-1). 

Posttranslational modification consists of 
various chemical changes that occur in pro¬ 
teins shortly after they are translated. 




CLINICAL COMMENTARY 2-1 

Osteogenesis Imperfecta, an Inherited Collagen Disorder 

As its name implies, osteogenesis imperfecta is a disease 
caused by defects in the formation of bone. This disorder, 
sometimes known as brittle bone disease, affects approxi¬ 
mately 1 in 15,000 to 1 in 25,000 births in all ethnic groups. 

Approximately 90% of osteogenesis imperfecta cases are 
caused by defects in type I collagen, a major component of 
bone that provides much of its structural stability. The function 
of collagen in bone is analogous to that of the steel bars incor¬ 
porated in reinforced concrete. 

When type I collagen is improperly formed, the bone loses 
much of its strength and fractures easily. Patients with osteo¬ 
genesis imperfecta can suffer hundreds of bone fractures (Fig. 
2-14), or they might experience only a few, making this disease 
highly variable in its expression (the reasons for this variability 
are discussed in Chapter 4). In addition to bone fractures, 
patients can have short stature, hearing loss, abnormal tooth 
development (dentinogenesis imperfecta), bluish sclerae, and 
various bone deformities, including scoliosis. Osteogenesis 


imperfecta was traditionally classified into four major types, all 
of which are caused by mutations in either of the two genes 
that encode type I collagen. Several additional types, caused by 
mutations in other genes, have subsequently been added 
(Table 2-3). There is currently no cure for this disease, and 
management consists primarily of the repair of fractures and, 
in some cases, the use of external or internal bone support (e.g., 
surgically implanted rods). Additional therapies include the 
administration of bisphosphonates to decrease bone resorption 
and human growth hormone to facilitate growth. Physical reha¬ 
bilitation also plays an important role in clinical management. 

Type I collagen is a trimeric protein (i.e., having three sub¬ 
units) with a triple helix structure. It is formed from a precursor 
protein, type 1 procollagen. Two of the three subunits of type 
1 procollagen, labeled pro-a1 (I) chains, are encoded by an 18-kb 
(kb = 1000 base pairs) gene on chromosome 17, and the third 
subunit, the pro-a2(l) chain, is encoded by a 38-kb gene on 
chromosome 7. Each of these genes contains more than 50 


TABLE 2-3 Subtypes of Osteogenesis Imperfecta 


TYPE DISEASE FEATURES 

I Mild bone fragility, blue sclerae, hearing loss in 60% of patients, normal or near-normal stature, few bone 

deformities, dentinogenesis imperfecta in some cases 

II Most severe form, with extreme bone fragility, long bone deformities, compressed femurs, gray-blue sclerae; lethal 

in the prenatal or perinatal periods (most die of respiratory failure during the first week of life, and survival beyond 
one year is highly unusual) 

III Severe bone fragility, very short stature, variably blue sclerae, progressive bone deformities, hearing loss in 80% of 

patients; dentinogenesis imperfecta is common 

IV Short stature, normal-to-gray sclerae, mild to moderate bone deformity, hearing loss in some patients, 

dentinogenesis imperfecta is common; bone fragility is variable 

V Similar to type IV but also includes calcification of interosseous membrane of forearm, radial head dislocation, and 

hyperplastic callus formation 

VI Similar to type IV but not caused by type I collagen mutations; calcification in interosseous membranes is not seen; 

no dentinogenesis imperfecta 

VII White sclerae, early lower limb deformities, shortening of all limbs, congenital fractures, osteopenia 

VIII Severe, often lethal phenotype similar to type II but not caused by type I collagen mutations; severe osteoporosis, 

shortened long bones, white sclerae 

Types l-IV are caused by mutations in the two genes that encode type I collagen protein; types V-VIII have been identified on the basis of 
distinct bone histology and are caused by mutations in genes involved in posttranslational processing of the gene product. Types VII and 
VIII have an autosomal recessive mode of inheritance, and several additional rare autosomal recessive forms of osteogenesis imperfecta 
have been identified. There is substantial phenotypic overlap among many of these categories of osteogenesis imperfecta. 
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CLINICAL COMMENTARY 2-1 


Osteogenesis Imperfecta, an Inherited Collagen Disorder—cont'd 



FIG 2-14 A, A stillborn infant with type II osteogenesis 
imperfecta (the perinatal lethal form). The infant had a 
type I procollagen mutation and short, slightly twisted 
limbs. B, Radiograph of an infant with type II osteogenesis 
imperfecta. Note calluses from rib fractures, which are 
observable as "beads" on the ribs (arrows). 


exons. After transcription and splicing, the mature mRNA 
formed from each gene is only 5 to 7 kb long. The mature 
mRNAs proceed to the cytoplasm, where they are translated 
into polypeptide chains by the ribosomal machinery of the cell. 

At this point, the polypeptide chains undergo a series of 
posttranslational modifications. Many of the proline and lysine 
residues* are hydroxylated (i.e., hydroxyl groups are added) to 
form hydroxyproline and hydroxylysine, respectively. (Several 
rare forms of osteogenesis imperfect, including type VII, are 
caused by mutations in genes involved in the hydroxylation 
process.) The three polypeptides, two pro-a1 (I) chains, and one 
pro-a2(l) chain, begin to associate with one another at their 
COOH termini. This association is stabilized by sulfide bonds 
that form between the chains near the COOH termini. The 
triple helix then forms, in zipper-like fashion, beginning at the 
COOH terminus and proceeding toward the NH 2 terminus. 
Some of the hydroxylysines are glycosylated (i.e., sugars are 
added), a modification that commonly occurs in the rough 
endoplasmic reticulum (see Fig. 2-1). The hydroxyl groups in 
the hydroxyprolines help to connect the three chains by forming 
hydrogen bonds, which stabilize the triple helix. Critical to 
proper folding of the helix is the presence of a glycine in every 
third position of each polypeptide. 

Once the protein has folded into a triple helix, it moves from 
the endoplasmic reticulum to the Golgi apparatus (see Fig. 2-1) 
and is secreted from the cell. Yet another modification then 
takes place: the procollagen is cleaved by proteases near both 
the NH 2 and the COOH termini of the triple helix, removing 
some amino acids at each end. These amino acids performed 
essential functions earlier in the life of the protein (e.g., helping 
to form the triple helix structure, helping to thread the protein 
through the endoplasmic reticulum) but are no longer needed. 
This cleavage results in the mature protein, type I collagen. The 
collagen then assembles itself into fibrils, which react with 
adjacent molecules outside the cell to form the covalent cross¬ 
links that impart tensile strength to the fibrils. 

The path from the DNA sequence to the mature collagen 
protein involves many steps (Fig. 2-15). The complexity of this 
path provides many opportunities for mistakes (in replication, 
transcription, translation, or posttranslational modification) that 
can cause disease. Of the more than 1500 type I collagen 
mutations now known to cause osteogenesis imperfecta, the 
most common type produces a replacement of glycine with 
another amino acid. Because only glycine is small enough to 
be accommodated in the center of the triple helix structure, 
substitution of a different amino acid causes instability of the 
structure and thus poorly formed fibrils. This type of mutation 
is often seen in severe forms of osteogenesis imperfecta. 
Other mutations can cause excess posttranslational modifica¬ 
tion of the polypeptide chains, again producing abnormal fibrils. 
Other examples of disease-causing mutations are provided in 
the suggested readings at the end of this chapter. 


Continued 
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CLINICAL COMMENTARY 2-1 

Osteogenesis Imperfecta, an Inherited Collagen Disorder—cont'd 



FIG 2-15 The process of collagen fibril formation. After the pro-a polypeptide chain is formed, 
a series of posttranslational modifications takes place, including hydroxylation and glycosylation. 
Three polypeptide chains assemble into a triple helix, which is secreted outside the cell. Portions 
of each end of the procollagen molecule are cleaved, resulting in the mature collagen molecule. 
These molecules then assemble into collagen fibrils. 

*A residue is an amino acid that has been incorporated into a polypeptide chain. 


THE STRUCTURE OF GENES 
AND THE GENOME 

Some aspects of gene structure, such as the existence of 
introns and exons, have already been touched on. Alterations 
of different parts of genes can have quite distinct conse¬ 
quences in terms of genetic disease. It is therefore necessary 
to describe more fully the details of gene structure. A sche¬ 
matic diagram of gene structure is given in Figure 2-16. 

Introns and Exons 

The intron-exon structure of genes, discovered in 1977, is 
one of the attributes that distinguishes eukaryotes from pro¬ 
karyotes. Introns form the major portion of most eukaryotic 


genes. As noted previously, introns are spliced out of the 
mRNA before it leaves the nucleus, and this splicing must be 
under very precise control. Enzymes that carry out splicing 
are directed to the appropriate locations by DNA sequences 
known as consensus sequences (so named because they are 
common in all eukaryotic organisms), which are situated 
adjacent to each exon. 

Because most eukaryotic genes are composed primarily of 
introns, it is natural to ask whether introns might have some 
function. One interesting hypothesis is that introns, by 
lengthening genes, encourage the shuffling of genes when 
homologous chromosomes exchange material during meiosis 
(see later discussion). It has also been suggested that introns 
evolved to modify the amount of time required for DNA 
replication and transcription. 
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FIG 2-16 Details of gene structure, showing promoter and upstream regulation (enhancer) 
sequences and a poly-A addition site. 


Surprisingly, some introns contain transcribed genes 
that are apparently unrelated to the gene in which the 
introns are contained. For example, introns of the human 
neurofibromatosis type 1 (NFl) gene contain three genes that 
are transcribed in the direction opposite that of the NFl gene 
itself These genes appear to have no functional relationship 
to the NFl gene. Similar gene inserts have been found within 
the factor Vlll gene (F8) on the human X chromosome. 

Types of DNA 

Although most of the emphasis in genetics is given to the 
DNA that encodes proteins, only 1% to 2% of the 3 billion 
nucleotide pairs in the human genome actually perform this 
role. It has been estimated recently that about 75% of human 
DNA is transcribed into mRNA at some time in some cells, 
but the function of most of this mRNA (if any) is unknown. 
To better understand the nature of all types of DNA, we 
briefly review the several categories into which it is classified 
(Fig. 2-17). 

The first and most important class of DNA is termed 
single-copy DNA. As the name implies, single-copy DNA 
sequences are seen only once (or possibly a few times) in the 
genome. Single-copy DNA accounts for about half of the 
genome and includes the protein-coding genes. However, 
protein-coding DNA (exons) represents only a small fraction 
of all single-copy DNA, most of which is found in introns or 
in DNA sequences that lie between genes. 

The other half of the genome consists of repetitive DNA, 
sequences that are repeated over and over again in the 
genome, often thousands of times. There are two major 
classes of repetitive DNA: dispersed repetitive DNA and sat¬ 
ellite DNA. Satellite repeats are clustered together in certain 
chromosome locations, where they occur as tandem repeats 
(i.e., the beginning of one repeat occurs immediately adjacent 
to the end of another). Dispersed repeats, as the name implies, 
tend to be scattered singly throughout the genome; they do 
not occur in tandem. 

The term satellite is used because these sequences, owing 
to their composition, can easily be separated by centrifuga¬ 
tion in a cesium chloride density gradient. The DNA appears 
as a satellite, separate from the other DNA in the gradient. 
This term is not to be confused with the satellites that can 
be observed microscopically on certain chromosomes (see 
Chapter 6). Satellite DNA accounts for approximately 8% to 
10% of the genome and can be further subdivided into several 
categories. a-Satellite DNA occurs as tandem repeats of a 
171-bp sequence that can extend to several million base pairs 


Other 

dispersed Exons 



FIG 2-17 Structural composition of the human genome. 
Because of limitations in mapping repetitive sequences, these 
figures are approximate. In addition, there is some overlap 
among categories (e.g., repeat sequences are sometimes 
found in introns). The category "other dispersed repeats" 
includes DNA transposons, LTR (long terminal repeat) ret- 
rotransposons, and low-copy number duplications. 


or longer. This type of satellite DNA is found near the cen¬ 
tromeres of chromosomes. Minisatellites are blocks of 
tandem repeats (each 14 to 500 bp long) whose total length 
is much smaller, usually a few thousand base pairs. A final 
category, microsatellites, are smaller still: the repeat units are 

1 to 13 bp long, and the total length of the array is usually 
less than a few hundred base pairs. Minisatellites and micro¬ 
satellites are of special interest in human genetics because 
they vary in length among individuals, making them highly 
useful for gene mapping (see Chapter 8). A minisatellite or 
microsatellite is found at an average frequency of one per 

2 kb in the human genome; altogether they account for about 
3% of the genome. 

Dispersed repetitive DNA makes up about 45% of the 
genome, and these repeats fall into several major categories 
(Fig. 2-18). The two most common categories are short inter¬ 
spersed elements (SINEs) and long interspersed elements 
(LINEs). Individual SINEs range in size from 90 to 500 bp, 
and individual LINEs can be as large as 7000 bp. One of the 
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FIG 2-18 Single-copy DNA sequences are unique and are dispersed throughout the genome. 
Satellite DNA sequences are repetitive elements that occur together in clusters. Dispersed 
repeats are similar to one another but do not cluster together. 


most important types of SINEs in humans is the 300-bp Alu 
element, so named because each repeat contains a DNA 
sequence that can be cut by the Alu restriction enzyme (see 
Chapter 3 for further discussion). The Alu repeats are a 
family of DNA elements, meaning that all of them have 
highly similar DNA sequences. About 1 million Alu repeats 
are scattered throughout the genome; they thus constitute 
approximately 10% of all human DNA. LINE repeats consti¬ 
tute another 20% of the human genome. A remarkable 
feature of Alu sequences, as well as LINEs, is that some of 
them can generate copies of themselves, which can then be 
inserted into other parts of the genome. This is how they have 
achieved such remarkable numbers. These insertions can 
sometimes interrupt a protein-coding gene, causing genetic 
disease (examples are discussed in Chapter 4). 

There are several major types of DNA, includ¬ 
ing single-copy DNA, satellite DNA, and dis¬ 
persed repetitive DNA. The lattertwo categories 
are both classes of repeated DNA sequences. 
Only about 1% to 2% of human DNA actually 
encodes proteins. 

THE CELL CYCLE 

During the course of development, each human progresses 
from a single-cell zygote (an egg cell fertilized by a sperm 
ceU) to a marvelously complex organism containing approxi¬ 
mately 100 trillion (lO^'*) individual cells. Because few cells 
last for a person’s entire lifetime, new ones must be gener¬ 
ated to replace those that die. Both of these processes— 
development and replacement—require the manufacture of 
new cells. The cell division processes that are responsible for 
the creation of new diploid cells from existing ones are 
mitosis (nuclear division) and cytokinesis (cytoplasmic divi¬ 
sion). Before dividing, a cell must duplicate its contents, 
including its DNA; this occurs during interphase. The alter¬ 
nation of mitosis and interphase is referred to as the cell 
cycle. 

As Figure 2-19 shows, a typical cell spends most of its life 
in interphase. This portion of the cell cycle is divided into 
three phases, Gl, S, and G2. During G1 (gap 1, the interval 
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FIG 2-19 Major phases of the mitotic cell cycle, showing the 
alternation of interphase and mitosis (division). 

between mitosis and the onset of DNA replication), synthesis 
of RNA and proteins takes place. DNA replication occurs 
during the S (synthesis) phase. During G2 (the interval 
between the S phase and the next mitosis), some DNA repair 
takes place, and the cell prepares for mitosis. By the time G2 
has been reached, the cell contains two identical copies of 
each of the 46 chromosomes. These identical chromosomes 
are referred to as sister chromatids. Sister chromatids often 
exchange material during interphase, a process known as 
sister chromatid exchange. 

The cell cycle consists of the alternation of cell 
division (mitosis and cytokinesis) and inter¬ 
phase. DNA replication and protein synthesis 
take place during interphase. 

The length of the cell cycle varies considerably from one 
cell type to another. In rapidly dividing cells such as those of 
epithelial tissue (found, for example, in the lining of the 
intestines and in the lungs), the cycle may be completed in 
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less than 10 hours. Other cells, such as those of the liver, 
might divide only once each year or so. Some cell types, such 
as skeletal muscle cells and neurons, largely lose their ability 
to divide and replicate in adults. Although all stages of the 
cell cycle have some variation in length, the great majority of 
variation is due to differences in the length of the G1 phase. 
When cells stop dividing for a long period, they are often said 
to be in the GO stage. 

Cells divide in response to important internal and exter¬ 
nal cues. Before the cell enters mitosis, for example, DNA 
replication must be accurate and complete and the cell must 
have achieved an appropriate size. The cell must respond to 
extracellular stimuli that require increased or decreased rates 
of division. Complex molecular interactions mediate this 
regulation. Among the most important of the molecules 
involved are the cyclin-dependent kinases (CDKs), a family 
of kinases that phosphorylate other regulatory proteins at 
key stages of the cell cycle. To carry out this function, the 
CDKs must form complexes with various cyclins, proteins 
that are synthesized at specific cell-cycle stages and are then 
degraded when CDK action is no longer needed. The cyclins 
and CDKs, as well as the many proteins that interact with 
them, are subjects of intense study because of their vital role 
in the cell cycle and because their malfunction can lead to 
cancer (see Chapter 11). 

The length of the cell cycle varies in different 
cell types. Critical to regulation of the cell cycle 
are CDKs, which phosphorylate other proteins, 
and cyclins, which form complexes with CDKs. 
Faulty regulation of the cell cycle can lead to 
cancer. 

Mitosis 

Although mitosis usually requires only 1 to 2 hours to com¬ 
plete, this portion of the cell cycle involves many critical and 
complex processes. Mitosis is divided into several phases (Fig. 
2-20). During prophase, the first mitotic stage, the chromo¬ 
somes become visible under a light microscope as they con¬ 
dense and coil (chromosomes are not clearly visible during 
interphase). The two sister chromatids of each chromosome 
lie together, attached at a point called the centromere. The 
nuclear membrane, which surrounds the nucleus, disappears 
during this stage. Spindle fibers begin to form, radiating 
from two centrioles located on opposite sides of the cell. The 
spindle fibers become attached to the centromeres of each 
chromosome and eventually pull the two sister chromatids in 
opposite directions. 

The chromosomes reach their most highly condensed state 
during metaphase, the next stage of mitosis. Because they are 
highly condensed, they are easiest to visualize microscopically 
during this phase. For this reason, clinical diagnosis of chro¬ 
mosome disorders is usually based on metaphase chromo¬ 
somes. During metaphase, the spindle fibers begin to contract 
and pull the centromeres of the chromosomes, which are now 
arranged along the middle of the spindle (the equatorial 
plane of the cell). 


During anaphase, the next mitotic stage, the centromere 
of each chromosome splits, allowing the sister chromatids to 
separate. The chromatids are then pulled by the spindle fibers, 
centromere first, toward opposite sides of the cell. At the end 
of anaphase, the cell contains 92 separate chromosomes, half 
lying near one side of the cell and half near the other side. If 
all has proceeded correctly, the two sets of chromosomes are 
identical. 

Telophase, the final stage of mitosis, is characterized by 
the formation of new nuclear membranes around each of the 
two sets of 46 chromosomes. Also, the spindle fibers disap¬ 
pear, and the chromosomes begin to decondense. Cytokinesis 
usually occurs after nuclear division and results in a roughly 
equal division of the cytoplasm into two parts. With the 
completion of telophase, two diploid daughter cells, both 
identical to the original cell, have been formed. 

Mitosis is the process through which two iden¬ 
tical diploid daughter cells are formed from a 
single diploid cell. 

Meiosis 

When an egg cell and a sperm cell unite to form a zygote, 
their chromosomes are combined into a single cell. Because 
humans are diploid organisms, there must be a mechanism 
to reduce the number of chromosomes in gametes to the 
haploid state. Otherwise the zygote would have 92, instead of 
the normal 46, chromosomes. The primary mechanism by 
which haploid gametes are formed from diploid precursor 
cells is meiosis. 

Two cell divisions occur during meiosis. Each meiotic divi¬ 
sion has been divided into stages with the same names as 
those of mitosis, but the processes involved in some of the 
stages are quite different (Fig. 2-21). During meiosis 1, often 
called the reduction division stage, two haploid cells are 
formed from a diploid cell. These diploid cells are the oogonia 
in females and the spermatogonia in males. After meiosis 1, 
a second meiosis, the equational division, takes place, during 
which each haploid cell is replicated. 

The first stage of the meiotic cell cycle is interphase I, 
during which important processes such as replication of chro¬ 
mosomal DNA take place. The second phase of meiosis 1, 
prophase I, is quite complex and includes many of the key 
events that distinguish meiosis from mitosis. Prophase I 
begins as the chromatin strands coil and condense, causing 
them to become visible as chromosomes. During the process 
of synapsis, the homologous chromosomes pair up, side by 
side, lying together in perfect alignment (in males, the X and 
Y chromosomes, being mostly nonhomologous, line up end 
to end). This pairing of homologous chromosomes is an 
important part of the cell cycle that does not occur in mitosis. 
As prophase I continues, the chromatids of the two chromo¬ 
somes intertwine. Each pair of intertwined homologous chro¬ 
mosomes is either bivalent (indicating two chromosomes in 
the unit) or tetrad (indicating four chromatids in the unit). 

A second key feature of prophase I is the formation of 
chiasmata (plural of chiasma), cross-shaped structures that 









24 


CHAPTER 2 Basic Cell Biology: Structure and Function of Genes and Chromosomes 


Centrioles 


Interphase 



—- Bipolar 
spindle fiber 


Prophase 



Sister 

chromatids 


Daughter cells 


Centromere 
Spindle pole 

Prometaphase 

Microtubule 

Chromatid 



Telophase 


Metaphase 




'Equatorial 

plane 

(metaphase 

plate) 


Anaphase 

FIG 2-20 The stages of mitosis, during which two identical diploid cells are formed from one 
original diploid cell. 


mark attachments between the homologous chromosomes 
(Fig. 2-22). Each chiasma indicates a point at which the 
homologous chromosomes exchange genetic material. This 
process, called crossing over, produces chromosomes that 
consist of combinations of parts of the original chromo¬ 
somes. This chromosomal shuffling is important because it 
greatly increases the possible combinations of genes in each 
gamete and thereby increases the number of possible combi¬ 
nations of human traits. Also, as discussed in Chapter 8, this 
phenomenon is critically important in inferring the order of 
genes along chromosomes. At the end of prophase I, the 
bivalents begin to move toward the equatorial plane, a spindle 


apparatus begins to form in the cytoplasm, and the nuclear 
membrane dissipates. 

Metaphase I is the next phase. As in mitotic metaphase, this 
stage is characterized by the completion of spindle formation 
and alignment of the bivalents, which are still attached at the 
chiasmata, in the equatorial plane. The two centromeres of 
each bivalent now lie on opposite sides of the equatorial plane. 

During anaphase I, the chiasmata disappear and the 
homologous chromosomes are pulled by the spindle fibers 
toward opposite poles of the cell. The key feature of this phase 
is that unlike the corresponding phase of mitosis, the centro¬ 
meres do not duplicate and divide, so that only half of the 
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FIG 2-21 The stages of meiosis, during which haploid gametes are formed from a diploid cell. 
For brevity, prophase II and telophase II are not shown. Note the relationship between meiosis 
and spermatogenesis and oogenesis. 


original number of chromosomes migrates toward each pole. 
The chromosomes migrating toward each pole thus consist 
of one member of each pair of autosomes and one of the sex 
chromosomes. 

The next stage, telophase I, begins when the chromo¬ 
somes reach opposite sides of the cell. The chromosomes 
uncoil slightly, and a new nuclear membrane begins to form. 
The two daughter cells each contain the haploid number of 
chromosomes, and each chromosome has two sister chroma¬ 
tids. In humans, cytokinesis also occurs during this phase. 
The cytoplasm is divided approximately equally between the 
two daughter cells in the gametes formed in males. In those 


formed in females, nearly all of the cytoplasm goes into one 
daughter cell, which will later form the egg. The other daugh¬ 
ter cell becomes a polar body, a small, nonfunctional cell that 
eventually degenerates. 

Meiosis I (reduction division) includes a pro¬ 
phase I stage in which homologous chromo¬ 
somes line up and exchange material (crossing 
over). During anaphase I the centromeres do 
not duplicate and divide. Consequently only 
one member of each pair of chromosomes 
migrates to each daughter cell. 
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Most chromosome disorders are caused by errors that 
occur during meiosis. Gametes can be created that contain 
missing or additional chromosomes or chromosomes with 
altered structures. In addition, mitotic errors that occur early 
in the life of the embryo can affect enough of the body’s cells 
to produce clinically significant disease. Mitotic errors occur¬ 
ring at any point in one’s lifetime can, under some circum¬ 
stances, cause cancer. Cancer genetics is discussed in Chapter 
11, and chromosome disorders are the subject of Chapter 6. 
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chromosomes 
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Result of 
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FIG 2-22 The process of chiasma formation and crossing 
over results in the exchange of genetic material between 
homologous chromosomes. 



The equational division, meiosis II, then begins with inter¬ 
phase II. This is a very brief phase. The important feature of 
interphase II is that, unlike interphase I, no replication of 
DNA occurs. Prophase II, the next stage, is quite similar to 
mitotic prophase, except that the cell nucleus contains only 
the haploid number of chromosomes. During prophase II, the 
chromosomes thicken as they coil, the nuclear membrane 
disappears, and new spindle fibers are formed. This is fol¬ 
lowed by metaphase II, during which the spindle fibers pull 
the chromosomes into alignment at the equatorial plane. 

Anaphase II then follows. This stage resembles mitotic 
anaphase in that the centromeres split and each carries a 
single chromatid toward a pole of the cell. The chromatids 
have now separated, but, because of chiasma formation and 
crossing over, the newly separated sister chromatids might 
not be identical (see Fig. 2-21). 

Telophase II, like telophase I, begins when the chromo¬ 
somes reach opposite poles of the cell. There they begin to 
uncoil. New nuclear membranes are formed around each 
group of chromosomes, and cytokinesis occurs. In gametes 
formed in males, the cytoplasm is again divided equally 
between the two daughter cells. The end result of male meiosis 
is thus four functional daughter cells, each of which has an 
equal amount of cytoplasm. In female gametes, unequal divi¬ 
sion of the cytoplasm again occurs, forming the egg cell 
and another polar body. The polar body formed during 
meiosis I sometimes undergoes a second division, so three 
polar bodies may be present when the second stage of meiosis 
is completed. 



Meiosis is a specialized cell division process in 
which a diploid cell gives rise to haploid 
gametes. This is accomplished by combining 
two rounds of division with only one round of 
DNA replication. 


The Relationship between Meiosis and 
Gametogenesis 

The stages of meiosis can be related directly to stages in 
gametogenesis, the formation of gametes (see Fig. 2-21). In 
mature males, the seminiferous tubules of the testes are pop¬ 
ulated by spermatogonia, which are diploid cells. After going 
through several mitotic divisions, the spermatogonia produce 
primary spermatocytes. Each primary spermatocyte, which 
is also diploid, undergoes meiosis I to produce a pair of sec¬ 
ondary spermatocytes, each of which contains 23 double- 
stranded chromosomes. These undergo meiosis II, and each 
produces a pair of spermatids that contain 23 single-stranded 
chromosomes. The spermatids then lose most of their cyto¬ 
plasm and develop tails for swimming as they become mature 
sperm cells. This process, known as spermatogenesis, con¬ 
tinues throughout the life of the mature male. 



In spermatogenesis, each diploid spermatogo¬ 
nium produces four haploid sperm cells. 


Oogenesis, the process by which female gametes are 
formed, differs in several important ways from spermatogen¬ 
esis. Whereas the cycle of spermatogenesis is constantly 
recurring, much of female oogenesis is completed before 
birth. Diploid oogonia divide mitotically to produce primary 
oocytes by the third month of fetal development. More than 
6 million primary oocytes are formed during gestation, and 
they are suspended in prophase I by the time of birth. Meiosis 
continues only when a mature primary oocyte is ovulated. In 
meiosis I, the primary oocyte produces one secondary oocyte 
(containing the cytoplasm) and one polar body. The second¬ 
ary oocyte then emerges from the follicle and proceeds down 
the fallopian tube, with the polar body attached to it. Meiosis 
II begins only if the secondary oocyte is fertilized by a sperm 
cell. If this occurs, one haploid mature ovum, containing the 
cytoplasm, and another haploid polar body are produced. 
The polar bodies eventually disintegrate. About 1 hour after 
fertilization, the nuclei of the sperm cell and ovum fuse, 
forming a diploid zygote. The zygote then begins its develop¬ 
ment into an embryo through a series of mitotic divisions. 



In oogenesis, one haploid ovum and three 
haploid polar bodies are produced meiotically 
from a diploid oogonium. In contrast to sper¬ 
matogenesis, which continues throughout the 
life of the mature male, the first phase of oogen¬ 
esis is completed before the female is born; 
oogenesis is then halted until ovulation occurs. 
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I STUDY QUESTIONS 


1. Consider the following double-stranded DNA sequence: 
5'-CAG AAG AAA ATT AAC ATG TAA-3' 
3'-GTC TTC TTT TAA TTG TAG ATT-5' 

If the bottom strand serves as the template, what is the 
mRNA sequence produced by transcription of this DNA 
sequence? What is the amino acid sequence produced by 
translation of the mRNA sequence? 

2. Arrange the following terms according to their hierarchi¬ 
cal relationship to one another: genes, chromosomes, 
exons, codons, nucleotides, genome. 

3. Less than 5% of human DNA encodes proteins. Further¬ 
more, in a given cell type only 10% of the coding DNA 
actively encodes proteins. Explain these statements. 


4. What are the major differences between mitosis and 
meiosis? 

5. The human body contains approximately 10^^ cells. Start¬ 
ing with a single-cell zygote, how many mitotic cell divi¬ 
sions, on average, would be required to produce this 
number of cells? 

6. How many mature sperm cells will be produced by 100 
primary spermatocytes? How many mature egg cells will 
be produced by 100 primary oocytes? 
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CHAPTER 


Genetic Variation: Its Origin and Detection 


Humans display a substantial amount of genetic variation. 
This is reflected in traits such as height, blood pressure, and 
skin color. Included in the spectrum of genetic variation are 
disease states, such as cystic fibrosis or type 1 neurofibroma¬ 
tosis (see Chapter 4). This aspect of genetic variation is the 
focus of medical genetics. 

All genetic variation originates from the process known as 
mutation, which is defined as a change in DNA sequence. 
Mutations can affect either germline cells (cells that produce 
gametes) or somatic cells (all cells other than germline cells). 
Mutations in somatic cells can lead to cancer and are thus of 
significant concern. However, this chapter is focused primar¬ 
ily on germline mutations, because they can be transmitted 
from one generation to the next. 

As a result of mutations, a gene can differ among individu¬ 
als in terms of its DNA sequence. The differing sequences are 
referred to as alleles. A gene’s location on a chromosome is 
termed a locus (from the Latin word for “place”). For example, 
it might be said that a person has a certain allele at the 
(3-globin locus on chromosome 11. If a person has the same 
allele on both members of a chromosome pair, he or she is 
said to be a homozygote. If the alleles differ in DNA sequence, 
he or she is a heterozygote. The combination of alleles that 
is present at a given locus is termed the genotype. 

In human genetics, the term mutation has often been 
reserved for DNA sequence changes that cause genetic dis¬ 
eases and are consequently relatively rare, with a population 
frequency less than 1%. DNA sequence variants that are more 
common in populations are conventionally described as 
polymorphisms (“many forms,” describing multiple alleles 
at a locus). Loci (plural of locus) that contain multiple alleles 
are termed polymorphic. Nowadays, however, alleles that 
have a frequency less than 1% are often called polymorphisms 
as well. In addition, many common polymorphisms are now 
known to influence the risks for complex, common diseases 
such as diabetes and heart disease (see Chapter 12), so the 
traditional distinction between mutation (rare and disease- 
causing) and polymorphism (common and benign) has 
become increasingly blurred. 

One of Gregor Mendel’s important contributions to genet¬ 
ics was to show that the effects of one allele at a locus can 
mask those of another allele at the same locus. He performed 
crosses (matings) between pea plants homozygous for a “tall” 
allele (i.e., having two identical copies of an allele that we will 
label H) and plants homozygous for a “short” allele (having 
two copies of an allele labeled h). This cross, which can 


produce only heterozygous (Hh) offspring, is illustrated in 
the Punnett square shown in Figure 3-1. Mendel found that 
the offspring of these crosses, even though they were hetero¬ 
zygotes, were all tall. This is because the H allele is dominant, 
and the h allele is recessive. (It is conventional to label the 
dominant allele in upper case and the recessive allele in lower 
case.) The term recessive comes from a Latin root meaning 
“to hide.” This describes the behavior of recessive alleles well: 
in heterozygotes, the consequences of a recessive allele are 
hidden. A dominant allele exerts its effect in both the homo¬ 
zygote (HH) and the heterozygote (Hh), whereas the presence 
of the recessive allele is physically observable only when it 
occurs in homozygous form (hh). Thus, short pea plants can 
be created only by crossing parent plants that each carry at 
least one h allele. An example is a heterozygote X heterozygote 
cross, shown in Figure 3-2. 

In this chapter, we examine mutation as the source of 
genetic variation. We discuss the types of mutation, the causes 
and consequences of mutation, and the biochemical and 
molecular techniques that are now used to detect genetic 
variation in human populations. 

MUTATION: THE SOURCE OF GENETIC 
VARIATION _ 

Types of Mutation 

Some mutations consist of an alteration of the number or 
structure of chromosomes in a cell. These major chromo¬ 
some abnormalities can be observed microscopically and are 
the subject of Chapter 6. Here, the focus is on mutations that 
affect only single genes and are not microscopically observ¬ 
able. Most of our discussion centers on mutations that take 
place in coding DNA or in regulatory sequences, because 
mutations that occur in other parts of the genome usually 
have no clinical consequences. 

One important type of single-gene mutation is the base- 
pair substitution, in which one base pair is replaced by 
another.”^ This can result in a change in the amino acid 
sequence. However, because of the redundancy of the genetic 
code, many of these mutations do not change the amino acid 


*In molecular genetics, base-pair substitutions are also termed point 
mutations. However, “point mutation” was used in classical genetics 
to denote any mutation small enough to be unobservable under a 
microscope. 
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FIG 3-1 Punnett square illustrating a cross between HH and 
hh homozygote parents. 


Parent 



FIG 3-2 Punnett square illustrating a cross between two Hh 
heterozygotes. 

sequence and thus typically have no effect. Such mutations 
are called silent substitutions. Base-pair substitutions that 
alter amino acids consist of two basic types: missense muta¬ 
tions, which produce a change in a single amino acid, and 
nonsense mutations, which produce one of the three stop 
codons (UAA, UAG, or UGA) in the messenger RNA (mRNA) 
(Fig. 3-3). Because stop codons terminate translation of the 
mRNA, nonsense mutations result in a premature termina¬ 
tion of the polypeptide chain. Conversely, if a stop codon is 
altered so that it encodes an amino acid, an abnormally elon¬ 
gated polypeptide can be produced. Alterations of amino acid 
sequences can have profound consequences, and many of the 
serious genetic diseases discussed later are the result of such 
alterations. 

A second major type of mutation consists of deletions or 
insertions of one or more base pairs. These mutations, which 
can result in extra or missing amino acids in a protein, are 
often detrimental. An example of such a mutation is the 3-bp 
deletion that is found in most persons with cystic fibrosis (see 
Chapter 4). Deletions and insertions tend to be especially 


harmful when the number of missing or extra base pairs is 
not a multiple of three. Because codons consist of groups of 
three base pairs, such insertions or deletions can alter the 
downstream codons. This is a frameshift mutation (Fig. 3-4). 
For example, the insertion of a single base (an A in the second 
codon) converts a DNA sequence read as 5'-ACT GAT TGC 
GTT-3' to 5'-ACT GAA TTG CGT-3'. This changes the amino 
acid sequence from Thr-Asp-Cys-Val to Thr-Glu-Leu-Arg. 
Often, a frameshift mutation produces a stop codon down¬ 
stream of the insertion or deletion, resulting in a truncated 
polypeptide. 

On a larger scale, duplications of whole genes can also 
lead to genetic disease. A good example is given by Charcot- 
Marie-Tooth disease. This disorder, named after the three 
physicians who described it more than a century ago, is a 
peripheral nervous system condition that leads to progressive 
atrophy of the distal limb muscles. It affects approximately 1 
in 2500 persons and exists in several different forms. About 
70% of patients who have the most common form (type lA) 
display a 1.5 million-bp duplication on one copy of chromo¬ 
some 17. As a result, they have three, rather than two, copies 
of the genes in this region. One of these genes, PMP22, 
encodes a component of peripheral myelin. The increased 
dosage of the gene product contributes to the demyelination 
that is characteristic of this form of the disorder. Interestingly, 
a deletion of this same region produces a different disease, 
hereditary neuropathy with liability to pressure palsies 
(paralysis). Because either a reduction (to 50%) or an increase 
(to 150%) in the gene product produces disease, this gene is 
said to display dosage sensitivity. Base-pair substitutions in 
PMP22 itself can produce yet another disease: Dejerine- 
Sottas syndrome, which is characterized by distal muscle 
weakness, sensory alterations, muscular atrophy, and enlarged 
spinal nerve roots. 

Other types of mutation can alter the regulation of tran¬ 
scription or translation. A promoter mutation can decrease 
the affinity of RNA polymerase for a promoter site, often 
resulting in reduced production of mRNA and thus decreased 
production of a protein. Mutations of transcription factor 
genes or enhancer sequences can have similar effects. 

Mutations can also interfere with the splicing of introns 
as mature mRNA is formed from the primary mRNA tran¬ 
script. Splice-site mutations, those that occur at intron- 
exon boundaries, alter the splicing signal that is necessary 
for proper excision of an intron. Splice-site mutations can 
occur at the GT sequence that defines the 5' splice site (the 
donor site) or at the AG sequence that defines the 3' splice 
site (the acceptor site). They can also take place in the 
sequences that lie near the donor and acceptor sites. When 
such mutations occur, the excision is often made within the 
next exon, at a splice site located in the exon. These splice 
sites, whose DNA sequences differ slightly from those of 
normal splice sites, are ordinarily unused and hidden within 
the exon. They are thus termed cryptic splice sites. The use 
of a cryptic site for splicing results in partial deletion of the 
exon or, in other cases, the deletion of an entire exon. As 
Figure 3-5 shows, splice-site mutations can also result in the 
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FIG 3-3 Base-pair substitution. Missense mutations (A) produce a single amino acid change, 
whereas nonsense mutations (B) produce a stop codon in the mRNA. Stop codons terminate 
translation of the polypeptide. 
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FIG 3-4 Frameshift mutations result from the addition or 
deletion of a number of bases that is not a multiple of three. 
This alters all of the codons downstream from the site of 
insertion or deletion. 


abnormal inclusion of part or all of an intron in the mature 
mRNA. Finally, a mutation can occur at a cryptic splice site, 
causing it to appear as a normal splice site and thus to 
compete with the normal splice site. 

Several types of DNA sequences are capable of propagat¬ 
ing copies of themselves; these copies are then inserted in 
other locations on chromosomes (examples include the LINE 
and Alu repeats, discussed in Chapter 2). Such insertions can 
cause frameshift mutations. The insertion of mobile ele¬ 
ments has been shown to cause isolated cases of type 1 neu¬ 
rofibromatosis, Duchenne muscular dystrophy, (3-thalassemia, 
familial breast cancer, familial polyposis (colon cancer), and 
hemophilia A and B (clotting disorders) in humans. 

The final type of mutation to be considered here affects 
tandem repeated DNA sequences (see Chapter 2) that occur 
within or near certain disease-related genes. The repeat units 
are usually 3 bp long, so a typical example would be CAG- 
CAGCAG. A normal person has a relatively small number of 
these tandem repeats (e.g., 10 to 30 CAG consecutive ele¬ 
ments) at a specific chromosome location. Occasionally, the 
number of repeats increases during meiosis or possibly 
during early fetal development, so that a newborn might have 
hundreds or even thousands of tandem repeats. When this 
occurs in certain regions of the genome, it causes genetic 
disease. Like other mutations, these expanded repeats can be 
transmitted to the patient’s offspring. More than 20 genetic 
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FIG 3-5 A, Normal splicing. B, Splice-site mutation. The donor sequence, GT, is replaced with 
AT. This results in an incorrect splice that leaves part of the intron in the mature mRNA transcript. 
In another example of splice-site mutation (C), a second GT donor site is created within the first 
intron, resulting in a combination of abnormally and normally spliced mRNA products. 


diseases are now known to be caused by expanded repeats 
(see Chapters 4 and 5). 

Mutations are the ultimate source of genetic 
variation. Some mutations result in genetic 
disease, but most have no physical effects. The 
principal types of mutation are missense, non¬ 
sense, frameshift, promoter, and splice-site 
mutations. Mutations can also be caused by 
the random insertion of mobile elements, and 
some genetic diseases are known to be caused 
by expanded repeats. 

Molecular Consequences of Mutation 

It is useful to think of mutations in terms of their effects on 
the protein product. Broadly speaking, mutations can produce 
either a gain of function or a loss of function of the protein 
product (Fig. 3-6). Gain-of-function mutations occasionally 


result in a completely novel protein product. More com¬ 
monly, they result in overexpression of the product or inap¬ 
propriate expression (i.e., in the wrong tissue or in the wrong 
stage of development). Gain-of-function mutations produce 
dominant disorders. Charcot-Marie-Tooth disease can result 
from overexpression of the protein product, which is consid¬ 
ered a gain-of-function mutation. Huntington disease, dis¬ 
cussed in Chapter 4, is another example. 

Loss-of-function mutations are often seen in recessive 
diseases, where the mutation results in the loss of 50% of 
the protein product (e.g., a metabolic enzyme) but the 50% 
that remains is sufficient for normal function. The hetero¬ 
zygote is thus unaffected, but the homozygote, having little 
or no protein product, is affected. In some cases, however, 
50% of the gene’s protein product is not sufficient for 
normal function (haploinsufficiency), and a dominant dis¬ 
order can result. Haploinsufficiency is seen, for example, 
in the autosomal dominant disorder known as familial 
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FIG 3-6 A, Gain-of-function mutations produce a novel 
protein product or an increased amount of protein 
product. B, Loss-of-function mutations decrease the 
amount of protein product. C, Dominant negative 
mutations produce an abnormal protein product that 
interferes with the otherwise normal protein product 
of the normal allele in a heterozygote. 
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hypercholesterolemia (see Chapter 12). In this disease, a 
single copy of a mutation (heterozygosity) reduces the 
number of low-density lipoprotein (LDL) receptors by 50%. 
Cholesterol levels in heterozygotes are approximately double 
those of normal homozygotes, resulting in a substantial 
increase in the risk of heart disease. As with most disorders 
involving haploinsufficiency, the disease is more serious in 
affected homozygotes (who have few or no functional LDL 
receptors) than in heterozygotes. 

A dominant negative mutation results in a protein product 
that not only is abnormal but also inhibits the function of the 
protein produced by the normal allele in the heterozygote. 
Typically, dominant negative mutations are seen in genes that 
encode multimeric proteins (i.e., proteins composed of two 
or more subunits). Type I collagen (see Chapter 2), which is 
composed of three helical subunits, is an example of such a 
protein. An abnormal helix created by a single mutation can 
combine with the other helices, distorting them and produc¬ 
ing a seriously compromised triple-helix protein. 

Mutations can result in either a gain of func¬ 
tion or a loss of function of the protein 
product. Gain-of-function mutations are some¬ 
times seen in dominant diseases. Loss of func¬ 
tion is seen in recessive diseases and in 
diseases involving haploinsufficiency, in which 
50% of the gene product is insufficient for 
normal function. In dominant negative muta¬ 
tions, the abnormal protein product interferes 
with the normal protein product. 


Clinical Consequences of Mutation: 

The Hemoglobin Disorders 

Genetic disorders of human hemoglobin are the most 
common group of single-gene diseases: an estimated 7% of 
the world’s population carries one or more mutations of the 
genes involved in hemoglobin synthesis. Because almost all 
of the types of mutation described in this chapter have been 
observed in the hemoglobin disorders, these diseases serve as 
an important illustration of the clinical consequences of 
mutation. 

The hemoglobin molecule is a tetramer composed of four 
polypeptide chains, two labeled a and two labeled (3. The [3 
chains are encoded by a gene on chromosome 11, and the a 
chains are encoded by two genes on chromosome 16 that are 
very similar to each other. Typically, an individual has two 
normal (3 genes and four normal a genes (Fig. 3-7). Ordinar¬ 
ily, tight regulation of these genes ensures that roughly equal 
numbers of a and (3 chains are produced. Each of these 
globin chains is associated with a heme group, which con¬ 
tains an iron atom and binds with oxygen. This property 
allows hemoglobin to perform the vital function of trans¬ 
porting oxygen in erythrocytes (red blood cells). 

The hemoglobin disorders can be classified into two broad 
categories: structural abnormalities, in which the hemoglobin 
molecule is altered, and thalassemias, a group of conditions 
in which either the a- or the |3-globin chain is structurally 
normal but reduced in quantity. Another condition, heredi¬ 
tary persistence of fetal hemoglobin (HPFH), occurs when 
fetal hemoglobin, encoded by the a-globin genes and by two 
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TABLE 3-1 

Summary of the Major Hemoglobin Disorders 


DISEASE 

MUTATION TYPE 

MAJOR DISEASE FEATURES 

Sickle cell disease 

p-globin missense mutation 

Anemia, tissue infarctions, infections 

HbH disease 

Deletion or abnormality of three of the four 
a-globin genes 

Moderately severe anemia, splenomegaly 

Hydrops fetalis 

Deletion or abnormality of all four a-globin 

Severe anemia or hypoxemia, congestive 

(Hb Barts) 

genes 

heart failure; stillbirth or neonatal death 

P^-Thalassemia 

Usually nonsense, frameshift, or splice-site 
donor or acceptor mutations; no p-globin 
produced 

Severe anemia, splenomegaly, skeletal 
abnormalities, infections; often fatal during 
first decade if untreated 

P^-Thalassemia 

Usually missense, regulatory, or splice-site 
consensus sequence or cryptic splice-site 
mutations; small amount of p-globln produced 

Features similar to those of p“-thalassemia 
but often somewhat milder 
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FIG 3-7 The a-globin gene cluster on chromosome 16 and 
the (3-globin gene cluster on chromosome 11. The p-globin 
cluster includes the e-globin gene, which encodes embryonic 
globin, and the y-globin genes, which encode fetal globin. The 
v|/P gene is not expressed. The a-globin cluster includes the 
^-globin gene, which encodes embryonic a-globin. 


P-globin-like genes called ^ and (see Fig. 3-7), continues 
to be produced after birth (normally, y-chain production 
ceases and P-chain production begins at the time of birth). 
HPFH does not cause disease but instead can compensate for 
a lack of normal adult hemoglobin. 

A large array of different hemoglobin disorders have been 
identified. The discussion that follows is a greatly simplified 
presentation of the major forms of these disorders. The 
hemoglobin disorders, the mutations that cause them, and 
their major features are summarized in Table 3-1. 

Sickle Cell Disease 

Sickle cell disease, which results from an abnormality of 
hemoglobin structure, is seen in approximately 1 in 400 to 1 
in 600 African American births. It is even more common in 
parts of Africa, where it can affect up to 1 in 50 births, and it 
is also seen occasionally in Mediterranean and Middle Eastern 
populations. Sickle cell disease is typically caused by a single 
missense mutation that effects a substitution of valine for 
glutamic acid at position 6 of the P-globin polypeptide chain 
(Fig. 3-8). In homozygotes, this amino acid substitution alters 
the structure of hemoglobin molecules such that they form 


aggregates, causing erythrocytes to assume a characteristic 
sickle shape under conditions of low oxygen tension (see Fig. 
3-8, A). These conditions are experienced in capillaries, the 
tiny vessels whose diameter is smaller than that of the eryth¬ 
rocyte. Normal erythrocytes (see Fig. 3-8, B) can squeeze 
through capillaries, but sickled erythrocytes are less flexible 
and are unable to do so. In addition, the abnormal erythro¬ 
cytes tend to stick to the vascular endothelium (the inner¬ 
most lining of blood vessels). 

The resultant vascular obstruction produces localized 
hypoxemia (lack of oxygen), painful sickling crises, and 
infarctions of various tissues, including bone, spleen, kidneys, 
brain, and lungs (an infarction is tissue death due to hypox¬ 
emia). Premature destruction of the sickled erythrocytes 
decreases the number of circulating erythrocytes and the 
hemoglobin level, producing anemia. The spleen becomes 
enlarged (splenomegaly), but infarctions eventually destroy 
this organ, producing some loss of immune function. This 
contributes to the recurrent and sometimes fatal bacterial 
infections (especially pneumonia) that are commonly seen in 
persons with sickle cell disease. About 10% of persons with 
sickle cell disease experience a stroke before age 20 years. In 
North America, it is estimated that the life expectancy of 
persons with sickle cell disease is reduced by about 30 years. 



Sickle cell disease, which causes anemia, 
tissue infarctions, and multiple infections, is 
the result of a single missense mutation that 
produces an amino acid substitution in the 
P-globin chain. 


Thalassemia 

The term thalassemia is derived from the Greek word thalassa 
(“sea”); thalassemia was first described in populations living 
near the Mediterranean Sea, although it is also common in 
portions of Africa, the Mideast, India, and Southeast Asia. In 
contrast to sickle cell disease, in which a mutation alters the 
structure of the hemoglobin molecule, the mutations that 
cause thalassemia reduce the quantity of either a globin or P 
globin. Thalassemia can be divided into two major groups, 
a-thalassemia and P-thalassemia, depending on the globin 
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FIG 3-8 A, Erythrocytes from patients with sickle cell disease assume a characteristic shape 
under conditions of low oxygen tension. B, Compare with normal erythrocytes. 


chain that is reduced in quantity. When one type of chain is 
decreased in number, the other chain type, unable to partici¬ 
pate in normal tetramer formation, tends to form molecules 
consisting of four chains of the excess type only. These are 
termed homotetramers, in contrast to the heterotetramers 
normally formed by a and (3 chains. In a-thalassemia, the 
a-globin chains are deficient, so the (3 chains (or y chains in 
the fetus) are found in excess. They form homotetramers that 
have a greatly reduced oxygen-binding capacity, producing 
hypoxemia. In (3-thaIassemia, the excess a chains form homo¬ 
tetramers that precipitate and damage the cell membranes of 
red blood cell precursors (i.e., the cells that form erythro¬ 
cytes). This leads to premature erythrocyte destruction and 
anemia. 

Most cases of a-thalassemia are caused by deletions of the 
a-globin genes. The loss of one or two of these genes has no 
clinical effect. The loss or abnormality of three of the a genes 
produces moderately severe anemia and splenomegaly (HbH 
disease). Loss of all four a genes, a condition seen primarily 
among Southeast Asians, produces hypoxemia in the fetus 
and hydrops fetalis (a condition in which there is a massive 
buildup of fluid). Severe hydrops fetalis often causes stillbirth 
or neonatal death. 

The a-thalassemia conditions are usually 
caused by deletions of a-globin genes. The 
loss of three of these genes leads to 


moderately severe anemia, and the loss of all 
four is fatal. 

Persons with a [3-globin mutation in one copy of chromo¬ 
some 11 (heterozygotes) are said to have (3-thalassemia minor, 
a condition that involves little or no anemia and does not 
ordinarily require clinical management. Those in whom both 
copies of the chromosome carry a [3-globin mutation develop 
either |3-thalassemia major (also called Cooley’s anemia) or 
a less-serious condition, (3-thalassemia intermedia. [3-Globin 
may be completely absent ((3“-thalassemia), or it may be 
reduced to about 10% to 30% of the normal amount ((3^- 
thalassemia). Typically, (3°-thalassemia produces a more severe 
disease phenotype, but because disease features are caused by 
an excess of a-globin chains, patients with [3°-thalassemia are 
less severely affected when they also have a-globin mutations 
that reduce the quantity of a-globin chains. 

[3-globin is not produced until after birth, so the effects of 
[3-thalassemia major are not seen clinically until the age of 2 
to 6 months. These patients develop severe anemia. If the 
condition is left untreated, substantial growth retardation can 
occur. The anemia causes bone marrow expansion, which in 
turn produces skeletal changes, including a protuberant 
upper jaw and cheekbones and thinning of the long bones 
(making them susceptible to fracture). Splenomegaly (Fig. 
3-9) and infections are common, and patients with untreated 
[3-thalassemia major often die during the first decade of life. 
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FIG 3-9 A child with p-thalassemia major who has severe 
splenomegaly. 

P-thalassemia can vary considerably in severity, depending 
on the precise nature of the responsible mutation. 

In contrast to a-thalassemia, gene deletions are relatively 
rare in P-thalassemia. Instead, most cases are caused by single¬ 
base mutations. Nonsense mutations, which result in prema¬ 
ture termination of translation of the P-globin chain, usually 
produce P“-thalassemia. Frameshift mutations also typically 
produce the P" form. In addition to mutations in the P-globin 
gene itself, alterations in regulatory sequences can occur. 
P-globin transcription is regulated by a promoter, two enhanc¬ 
ers, and an upstream region known as the locus control region 
(LCR) (see Fig. 3-7). Mutations in these regulatory regions 
usually result in reduced synthesis of mRNA and a reduction, 
but not complete absence, of P-globin (P^-thalassemia). 
Several types of splice-site mutations have also been observed. 
If a point mutation occurs at a donor or acceptor site, normal 
splicing is destroyed completely, producing P“-thalassemia. 
Mutations in the surrounding consensus sequences usually 
produce P‘*'-thalassemia. Mutations also occur in the cryptic 
splice sites found in introns or exons of the P-globin gene, 
which causes these sites to be available to the splicing mecha¬ 
nism. These additional splice sites then compete with the 
normal splice sites, producing some normal and some abnor¬ 
mal P-globin chains. The result is usually P^-thalassemia. 

Many different types of mutations can pro¬ 
duce p-thalassemia. Nonsense, frameshift, and 
splice-site donor and acceptor mutations tend 
to produce more severe disease. Regulatory 
mutations and those involving splice-site con¬ 
sensus sequences and cryptic splice sites tend 
to produce less severe disease. 


Hundreds of different P-globin mutations have been 
reported. Consequently, most patients with P-thalassemia are 
not homozygotes in the strict sense: they usually have a dif¬ 
ferent P-globin mutation on each copy of chromosome 11 
and are termed compound heterozygotes (Fig. 3-10). Even 
though the mutations differ, each of the two P-globin genes 
is altered, producing a disease state. It is common to apply 
the term homozygote loosely to compound heterozygotes. 

Patients with sickle cell disease or P-thalassemia major 
are sometimes treated with blood transfusions and with che¬ 
lating agents that remove excess iron introduced by the 
transfusions. Prophylactic administration of antibiotics and 
antipneumococcal vaccine help to prevent bacterial infec¬ 
tions in patients with sickle cell disease, and analgesics are 
administered for pain relief during sickling crises. Bone 
marrow transplantation, which provides donor stem cells 
that produce genetically normal erythrocytes, has been per¬ 
formed on patients with severe P-thalassemia or sickle cell 
disease. However, it is often impossible to find a suitably 
matched donor, and the mortality rate from this procedure 
is still fairly high (approximately 5% to 30%, depending on 
the severity of disease and the age of the patient). A lack of 
normal adult P-globin can be compensated for by reactivat¬ 
ing the genes that encode fetal P-globin (the y-globin genes, 
discussed previously). Agents such as hydroxycarbamide and 
butyrate can reactivate these genes and are being investi¬ 
gated. Also, P-thalassemia is a strong candidate for gene 
therapy (see Chapter 13). 

Causes of Mutation 

A large number of agents are known to cause induced muta¬ 
tions. These mutations, which are attributed to known envi¬ 
ronmental causes, can be contrasted with spontaneous 
mutations, which arise naturally in cells, for example during 
DNA replication. Agents that cause induced mutations are 
known collectively as mutagens. Animal studies have shown 
that radiation is an important class of mutagen (Clinical 
Commentary 3-1). Ionizing radiation, such as that produced 
by x-rays and nuclear fallout, can eject electrons from atoms, 
forming electrically charged ions. When these ions are situ¬ 
ated within or near the DNA molecule, they can promote 
chemical reactions that change DNA bases. Ionizing radiation 
can also break the bonds of double-stranded DNA. This form 
of radiation can reach all cells of the body, including the 
germline cells. 

Nonionizing radiation does not form charged ions but 
can move electrons from inner to outer orbits within an 
atom. The atom becomes chemically unstable. Ultraviolet 
(UV) radiation, which occurs naturally in sunlight, is an 
example of nonionizing radiation. UV radiation causes the 
formation of covalent bonds between adjacent pyrimidine 
bases (i.e., cytosine or thymine). These pyrimidine dimers (a 
dimer is a molecule having two subunits) are unable to pair 
properly with purines during DNA replication; this results in 
a base-pair substitution (Fig. 3-11). Because UV radiation is 
absorbed by the skin, it does not reach the germline but can 
cause skin cancer (Clinical Commentary 3-2). 
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FIG 3-10 A, True homozygotes have two alleles that are identical in DNA sequence. Here, the 
homozygote has two copies of a single-base mutation, shown by the asterisk in the same posi¬ 
tion in the DNA sequence. Both mutations (alleles 1 and 2) have a loss-of-function effect, giving 
rise to a recessive disease. B, The same effect is seen in a compound heterozygote, which has 
two different mutations (asterisks) in two different locations in the gene's DNA sequence. Each 
allele has a loss-of-function effect, again causing a recessive disease. 



FIG 3-11 A, Pyrimidine dimers originate when covalent bonds form between adjacent pyrimidine 
(cytosine or thymine) bases. This deforms the DNA, interfering with normal base pairing. B, The 
defect is repaired by removal and replacement of the dimer and bases on either side of it, with 
the complementary DNA strand used as a template. 
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CLINICAL COMMENTARY 3-1 

The Effects of Radiation on Mutation Rates 

Because mutation is a rare event, it is difficult to measure 
directly in humans. The relationship between radiation expo¬ 
sure and mutation is similarly difficult to assess. For a person 
living in a developed country, a typical lifetime exposure to 
ionizing radiation is about 6 to 7 rem.* About one third to one 
half of this amount is thought to originate from medical and 
dental x-ray procedures. 

Unfortunately, a few human populations have received much 
larger radiation doses. The most thoroughly studied such popu¬ 
lation consists of the survivors of the atomic bomb blasts that 
occurred in Hiroshima and Nagasaki, Japan, at the close of 
World War II. Many of those who were exposed to high doses 
of radiation died from radiation sickness. Others survived, and 
many of the survivors produced offspring. 

To study the effects of radiation exposure in this population, 
a large team of Japanese and American scientists conducted 
medical and genetic investigations of some of the survivors. A 
significant number developed cancers and chromosome abnor¬ 
malities in their somatic cells, probably as a consequence of 
radiation exposure. To assess the effects of radiation exposure 
on the subjects' germlines, the scientists compared the off¬ 
spring of those who suffered substantial radiation exposure 
with the offspring of those who did not. Although it is difficult 
to establish radiation doses with precision, there is no doubt 
that, in general, those who were situated closer to the blasts 
suffered much higher exposure levels. It is estimated that the 
exposed group received roughly 30 to 60 rem of radiation, 
many times the average lifetime radiation exposure. 

In a sample of more than 76,000 offspring of these survivors, 
researchers assessed a large number of factors, including 
stillbirths, chromosome abnormalities, birth defects, cancer 
before 20 years of age, death before 26 years of age, and 
various measures of growth and development (e.g., intelli¬ 
gence quotient). There were no statistically significant differ¬ 
ences between the offspring of persons who were exposed to 
radiation and the offspring of those who were not exposed. In 
addition, direct genetic studies of mutations have been carried 
out using minisatellite polymorphisms and protein electropho¬ 
resis, a technique that detects mutations that lead to amino 


acid changes (discussed elsewhere in this chapter). Parents 
and offspring were compared to determine whether germline 
mutations had occurred at various loci. The numbers of muta¬ 
tions detected in the exposed and unexposed groups were 
statistically equivalent. 

More recently, studies of those who were exposed to radia¬ 
tion from the Chernobyl nuclear power plant accident have 
demonstrated a significant increase in thyroid cancers among 
children exposed to radiation. This reflects the effects of 
somatic mutations. The evidence for increased frequencies of 
germline mutations in protein-coding DNA, however, remains 
unclear. A number of other studies of the effects of radiation 
on humans have been reported, including investigations of 
those who live near nuclear power plants. The radiation doses 
received by these persons are substantially smaller than those 
of the populations discussed previously, and the results of 
these studies are equivocal. 

It is remarkable that even though there was substantial evi¬ 
dence for radiation effects on somatic ceils in the Hiroshima 
and Nagasaki studies, no detectable effect could be seen for 
germline cells. What could account for this? Because large 
doses of radiation are lethal, many of those who would have 
been most strongly affected would not be included in these 
studies. Furthermore, because germline mutation rates are 
very small, even relatively large samples of radiation-exposed 
persons may be insufficient to detect increases in mutation 
rates. It is also possible that DNA repair compensated for some 
radiation-induced germline damage. 

These results argue that radiation exposure, which is clearly 
associated with somatic mutations, should not be taken lightly. 
Above-ground nuclear testing in the American Southwest has 
produced increased rates of leukemia and thyroid cancer in a 
segment of the population. Radon, a radioactive gas that is 
produced by the decay of naturally occurring uranium, can be 
found at dangerously high levels in some homes and poses a 
risk for lung cancer. Any unnecessary exposure to radiation, 
particularly to the gonads or to developing fetuses, should be 
avoided. 


*A rem is a standard unit for measuring radiation exposure. It is roughly equal to 0.01 joule of absorbed energy per kilogram of tissue. 


CLINICAL COMMENTARY 3-2 

Xeroderma Pigmentosum: A Disease of Faulty DNA Repair 

An inevitable consequence of exposure to UV radiation is the 
formation of potentially dangerous pyrimidine dimers in the 
DNA of skin cells. Fortunately, the highly efficient nucleotide 
excision repair (NER) system removes these dimers in normal 
persons. Among those affected with the rare autosomal reces¬ 
sive disease xeroderma pigmentosum (XP), this system does 
not work properly, and the resulting DNA replication errors lead 
to base-pair substitutions in skin cells. XP varies substantially 
in severity, but early symptoms are usually seen in the first 1 


to 2 years of life. Patients develop dry, scaly skin (xeroderma) 
along with extensive freckling and abnormal skin pigmentation 
(pigmentosum). Skin tumors, which can be numerous, typically 
appear by 10 years of age. It is estimated that the risk of skin 
tumors in persons with XP is elevated approximately 1000-fold. 
These cancers are concentrated primarily in sun-exposed parts 
of the body. Patients are advised to avoid sources of UV light 
(e.g., sunlight), and cancerous growths are removed surgically. 
Neurological abnormalities are seen in about 30% of persons 


Continued 
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CLINICAL COMMENTARY 3-2 

Xeroderma Pigmentosum: A Disease of Faulty DNA Repair—coat'd 

with XP. Severe, potentially lethal malignancies can occur 
before 20 years of age. 

The NER system is encoded by at least 28 different genes, 
and inherited mutations in any of seven of these genes can 
give rise to XP. These genes encode helicases that unwind 
the double-stranded DNA helix; an endonuclease that cuts the 
DNA at the site of the dimer; an exonuclease that removes the 
dimer and nearby nucleotides; a polymerase that fills the gap 
with DNA bases (using the complementary DNA strand as a 
template); and a ligase that rejoins the corrected portion of 
DNA to the original strand. 

It should be emphasized that the expression of XP requires 
inherited germline mutations of an NER gene as well as sub¬ 
sequent uncorrected somatic mutations of genes in skin cells. 
Some of these somatic mutations can affect genes that 
promote cancer (see Chapter 11 ), resulting in tumor formation. 
The skin-cell mutations themselves are somatic and thus are 
not transmitted to future generations. 

NER is but one type of DNA repair. The table below provides 
examples of a number of other diseases that result from 
defects in various types of DNA repair mechanisms (Fig. 3-12) 
(Table 3-2). 



FIG 3-12 Xeroderma pigmentosum. This patient's skin has 
multiple hyperpigmented lesions, and skin tumors on the 
forehead have been marked for excision. 


TABLE 3-2 Examples of Diseases That Are Caused by a Defect in DNA Repair 

DISEASE 

FEATURES 

TYPE OF REPAIR DEFECT 

Xeroderma pigmentosum 

Skin tumors, photosensitivity, cataracts, 
neurological abnormalities 

Nucleotide excision repair defects, including 
mutations in helicase and endonuclease 

genes 

Cockayne syndrome 

Reduced stature, skeletal abnormalities, optic 
atrophy, deafness, photosensitivity, mental 
retardation 

Defective repair of UV-induced damage in 
transcriptionally active DNA; considerable 
etiological and symptomatic overlap with 
xeroderma pigmentosum and 
trichothiodystrophy 

Fanconi anemia 

Anemia; leukemia susceptibility; limb, kidney, 
and heart malformations; chromosome 
instability 

As many as eight different genes may be 
involved, but their exact role in DNA repair 
is not yet known 

Bloom syndrome 

Growth deficiency, immunodeficiency, 
chromosome instability, increased cancer 
incidence 

Mutations in the reqQ helicase family 

Werner syndrome 

Cataracts, osteoporosis, atherosclerosis, loss 
of skin elasticity, short stature, diabetes, 
increased cancer incidence; sometimes 
described as "premature aging" 

Mutations in the reqQ helicase family 

Ataxia-telangiectasia 

Cerebellar ataxia, telangiectases,* immune 
deficiency, increased cancer incidence, 
chromosome instability 

Normal gene product is likely to be involved 
in halting the cell cycle after DNA damage 

occurs 

Flereditary nonpolyposis 

Proximal bowel tumors, increased 

Mutations in any of six DNA mismatch-repair 

colorectal cancer 

susceptibility to several other types of cancer 

genes 

*Telangiectases are vascular lesions caused by the dilatation of small blood vessels. 

This typically produces discoloration of the skin. 
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A variety of chemicals can also induce mutations, some¬ 
times because of their chemical similarity to DNA bases. 
Because of this similarity, these base analogs, such as 
5-bromouracil, can be substituted for a true DNA base during 
replication. The analog is not exactly the same as the base it 
replaces, so it can cause pairing errors during subsequent 
replications. Other chemical mutagens, such as acridine dyes, 
can physically insert themselves between existing bases, dis¬ 
torting the DNA helix and causing frameshift mutations. Still 
other mutagens can directly alter DNA bases, causing replica¬ 
tion errors. An example of the latter is nitrous acid, which 
removes an amino group from cytosine, converting it to 
uracil. Although uracil is normally found in RNA, it mimics 
the pairing action of thymine in DNA. Thus, it pairs with 
adenine instead of guanine, as the original cytosine would 
have done. The end result is a base-pair substitution. 

Hundreds of chemicals are now known to be mutagenic 
in laboratory animals. Among these are nitrogen mustard, 
vinyl chloride, alkylating agents, formaldehyde, sodium 
nitrite, and saccharin. Some of these chemicals are much 
more potent mutagens than others. Nitrogen mustard, for 
example, is a powerful mutagen, whereas saccharin is a rela¬ 
tively weak one. Although some mutagenic chemicals are 
produced by humans, many occur naturally in the environ¬ 
ment (e.g., aflatoxin Bi, a common contaminant of foods). 

Many substances in the environment are 
known to be mutagenic, including ionizing and 
nonionizing radiation and hundreds of differ¬ 
ent chemicals. These mutagens are capable of 
causing base substitutions, deletions, and 
frameshifts. Ionizing radiation can induce 
double-stranded DNA breaks. Some mutagens 
occur naturally, and others are generated by 
humans. 

DNA Repair 

Considering that 3 billion DNA base pairs must be repli¬ 
cated in each cell division, and considering the large number 
of mutagens to which we are exposed, DNA replication is 
surprisingly accurate. A primary reason for this accuracy is 
the process of DNA repair, which takes place in all normal 
cells of higher organisms. Several dozen enzymes are 
involved in the repair of damaged DNA. They collectively 
recognize an altered base, excise it by cutting the DNA 
strand, replace it with the correct base (determined from the 
complementary strand), and reseal the DNA. These repair 
mechanisms are estimated to correct at least 99.9% of initial 
errors. 

Because DNA repair is essential for the accurate replica¬ 
tion of DNA, defects in DNA repair systems can lead to many 
types of disease. For example, inherited mutations in genes 
responsible for DNA mismatch repair result in the persis¬ 
tence of cells with replication errors (i.e., mismatches) and 
can lead to some types of cancer (see Chapter 11). A dimin¬ 
ished capacity to repair douhle-stranded DNA breaks can 
lead to ovarian and/or breast cancer. Nucleotide excision 


repair is necessary for the removal of larger changes in the 
DNA helix (e.g., pyrimidine dimers); defects in excision 
repair lead to a number of diseases, of which xeroderma 
pigmentosum is an important example (see Clinical Com¬ 
mentary 3-2). 

DNA repair helps to ensure the accuracy of the 
DNA sequence by correcting replication errors 
(mismatches), repairing double-stranded DNA 
breaks, and excising damaged nucleotides. 

Mutation Rates 

How often do spontaneous mutations occur? At the nucleo¬ 
tide level, the mutation rate is estimated to be about 10“* per 
base pair per generation (this figure represents mutations that 
have escaped the process of DNA repair). Thus, each gamete 
contains approximately 30 new mutations, the great majority 
of which occur in noncoding DNA. At the level of the gene, 
the mutation rate is quite variable, ranging from 10“‘‘ to 10“^ 
per locus per cell division. There are at least two reasons for 
this large range of variation: the size of the gene and the 
susceptibility of certain nucleotide sequences. 

First, genes vary tremendously in size. The somatostatin 
gene, for example, is quite small, containing 1480 bp. In con¬ 
trast, the gene responsible for Duchenne muscular dystrophy 
(DMD) spans more than 2 million bp. As might be expected, 
larger genes present larger targets for mutation and usually 
experience mutation more often than do smaller genes. The 
DMD gene, as well as the genes responsible for hemophilia A 
and type 1 neurofibromatosis, are all very large and have high 
mutation rates. 

Second, it is well established that certain nucleotide 
sequences are especially susceptible to mutation. These are 
termed mutation hot spots. The best-known example is the 
two-base (dinucleotide) sequence CG. In mammals, about 
80% of CG dinucleotides are methylated: a methyl group is 
attached to the cytosine base (these dinucleotide sequences 
are also labeled CpG [cytosine-phosphate-guanine], to dis¬ 
tinguish the two-base DNA sequence from a single pair of 
complementary bases, C and G). A methylated cytosine, 
5-methylcytosine, easily loses an amino group, converting it 
to thymine. The end result is a mutation from cytosine to 
thymine (Fig. 3-13). Surveys of mutations in human genetic 
diseases have shown that the mutation rate at CG dinucleo¬ 
tides is about 12 times higher than at other dinucleotide 
sequences. Mutation hot spots, in the form of CG dinucleo¬ 
tides, have been identified in a number of important human 
disease genes, including the procollagen genes responsible for 
osteogenesis imperfecta (see Chapter 2). Other disease exam¬ 
ples are discussed in Chapters 4 and 5. 

Parental age is strongly correlated with the probability of 
transmitting a mutation to one’s offspring. Some chromo¬ 
some abnormalities increase dramatically with maternal 
age (see Chapter 6), and single-gene mutations increase 
with paternal age. The latter is seen in several single-gene 
disorders, including Marfan syndrome and achondroplasia. 
As Figure 3-14 shows, the risk of producing a child with 
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Cytosine 5-methylcytosine Thymine 

FIG 3-13 Cytosine methylation. The addition of a methyl group (CH 3 ) to a cytosine base forms 
5-methylcytosine. The subsequent loss of an amino group (deamination) forms thymine. The 
result is a cytosine ^ thymine substitution. 



Age 

FIG 3-14 Paternal age effect. For some single-gene disor¬ 
ders, the risk of producing a child with the condition (y-axis) 
increases with the father's age (x-axis). 


Marfan syndrome is several times higher for a father older 
than 40 years than for a father in his 20s. This paternal age 
effect is usually attributed to the fact that the stem cells giving 
rise to sperm cells continue to divide throughout life, which 
allows a progressive buildup of DNA replication errors. 
Recent comparisons of whole genome sequences in parents 
and offspring estimate that approximately two additional 
mutations are transmitted with each additional year of pater¬ 
nal age. 

Large genes, because of their size, are gener¬ 
ally more likely to experience mutations than 
are small genes. Mutation hot spots, particu¬ 
larly methylated CG dinucleotides, experience 
elevated mutation rates. For many single-gene 
disorders, there is a substantial increase in 
mutation risk with advanced paternal age. 


DETECTION AND MEASUREMENT OF 
GENETIC VARIATION 

For centuries, humans have been intrigued by the differences 
that can be seen among individuals. Attention was long 
focused on observable differences such as skin color or body 
shape and size. Only in the 20th century did it become 
possible to examine variation in genes, the consequence of 
mutations accumulated through time. The evaluation and 
measurement of this variation in populations and families 
are important for mapping genes to specific locations on 
chromosomes, a key step in determining gene function (see 
Chapter 8). The evaluation of genetic variation also provides 
the basis for genetic diagnosis, and it is highly useful in foren¬ 
sics. In this section, several key approaches to detecting genetic 
variation in humans are discussed in historical sequence. 

Blood Groups 

Several dozen blood group systems have been defined on the 
basis of antigens located on the surfaces of erythrocytes. 
Some are involved in determining whether a person can 
receive a blood transfusion from a specific donor. Because 
individuals differ extensively in terms of blood groups, these 
systems provided an important early means of assessing 
genetic variation. 

Each of the blood group systems is determined by a dif¬ 
ferent gene or set of genes. The various antigens that can be 
expressed within a system are the result of different DNA 
sequences in these genes. Two blood-group systems that 
have special medical significance—the ABO and Rh blood 
groups—are discussed here. The ABO and Rh systems are 
both important in determining the compatibility of blood 
transfusions and tissue grafts. Some combinations of these 
blood groups can produce maternal-fetal incompatibility, 
sometimes with serious results for the fetus. These issues are 
discussed in detail in Chapter 9. 

The ABO Blood Group 

Human blood transfusions were carried out as early as 1818, 
but they were often unsuccessful. After transfusion, some 
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recipients suffered a massive, sometimes fatal, hemolytic 
reaction. In 1900 Karl Landsteiner discovered that this reac¬ 
tion was caused by the ABO antigens located on erythrocyte 
surfaces. The ABO system consists of two major antigens, 
labeled A and B. A person can have one of four major blood 
types: people with blood type A carry the A antigen on their 
erythrocytes, those with type B carry the B antigen, those 
with type AB carry both A and B, and those with type O carry 
neither antigen. Each individual has antibodies that react 
against any antigens that are not found on their own red 
blood cell surfaces. For example, a person with type A blood 
has anti-B antibodies, and transfusing type B blood into this 
person provokes a severe antibody reaction. It is straightfor¬ 
ward to determine ABO blood type in the laboratory by 
mixing a small sample of a person’s blood with solutions 
containing different antibodies and observing which combi¬ 
nations cause the observable clumping that is characteristic 
of an antibody-antigen interaction. 

The ABO system, which is encoded by a single gene on 
chromosome 9, consists of three primary alleles, labeled l\ 

and 7°. (There are also subtypes of both the and f 
alleles, but they are not addressed here.) Persons with the 7^ 
allele have the A antigen on their erythrocyte surfaces (blood 
type A), and those with f have the B antigen on their 
cell surfaces (blood type B). Those with both alleles express 
both antigens (blood type AB), and those with two copies 
of the 7° allele have neither antigen (type O blood). Because 
the 7° allele produces no antigen, persons who are f or f 
f heterozygotes have blood types A and B, respectively 
(Table 3-3). 

Because populations vary substantially in terms of the 
frequency with which the ABO alleles occur, the ABO locus 
was the first blood group system to be used extensively in 
studies of genetic variation among individuals and popula¬ 
tions. For example, early studies showed that the A antigen is 
relatively common in western European populations, and 
the B antigen is especially common among Asians. Neither 
antigen is common among native South American popula¬ 
tions, the great majority of whom have blood type O. 

The Rh System 

Like the ABO system, the Rh system is defined on the basis 
of antigens that are present on erythrocyte surfaces. This 


TABLE 3-3 Relationship between ABO 
Genotype and Blood Type 


GENOTYPE 

BLOOD TYPE 

ANTIBODIES PRESENT 

fAfA 

A 

Anti-B 

jAp 

A 

Anti-B 

/S/S 

B 

Anti-A 

/S/0 

B 

Anti-A 

/A/S 

AB 

None 

PP 

0 

Anti-A and anti-B 


system is named after the rhesus monkey, the experimental 
animal in which it was first isolated by Landsteiner in the 
late 1930s. It is typed in the laboratory by a procedure 
similar to the one described for the ABO system. Rh alleles 
vary considerably among individuals and populations and 
thus have been another highly useful tool for assessing 
genetic variation. The molecular basis of variation in both 
the ABO and the Rh systems has been elucidated (for 
further details, see the suggested readings at the end of this 
chapter), and it is becoming increasingly common to type 
these systems by directly examining an individual’s DNA 
sequence rather than by assessing an antibody-antigen 
reaction. 

The blood groups, of which the ABO and Rh 
systems are examples, have provided an 
important means of studying human genetic 
variation. Blood group variation is the result 
of antigens that occur on the surface of 
erythrocytes. 

Protein Electrophoresis 

Protein electrophoresis, developed first in the 1930s and 
applied widely to humans in the 1950s and 1960s, increased 
the number of detectable polymorphic loci considerably. This 
technique makes use of the fact that a single amino acid dif¬ 
ference in a protein (the result of a mutation in the corre¬ 
sponding DNA sequence) can cause a slight difference in the 
electrical charge of the protein. 

An example is the common sickle cell disease mutation 
discussed earlier. The replacement of glutamic acid with 
valine in the jJ-globin chain produces a difference in electri¬ 
cal charge because glutamic acid has two carboxyl groups, 
whereas valine has only one carboxyl group. Electrophoresis 
can be used to determine whether a person has normal 
hemoglobin (HbA) or the mutation that causes sickle cell 
disease (HbS) (Fig. 3-15). The hemoglobin is placed in 
an electrically charged gel composed of starch, agarose, or 
polyacrylamide (see Fig. 3-15, A). The slight difference in 
charge resulting from the amino acid difference causes the 
HbA and HbS forms to migrate at different rates through 
the gel. The protein molecules are allowed to migrate for 
several hours and are then stained with chemical solutions 
so that their positions can be seen (see Fig. 3-15, 77). From 
the resulting pattern it can be determined whether the 
person is an HbA homozygote, an HbS homozygote, or a 
heterozygote having HbA on one chromosome copy and 
HbS on the other. 

Protein electrophoresis has been used to detect amino 
acid variation in hundreds of human proteins. However, 
silent substitutions, which do not alter amino acids, cannot 
be detected by this approach. In addition, some amino 
acid substitutions do not alter the electrical charge of the 
protein molecule. For these reasons, protein electropho¬ 
resis detects only about one third of the mutations that 
occur in coding DNA. In addition, single-base substitutions 
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Tissue extract 
in sample slot 




Stained bands show 
position of enzyme 



substrate and salt 


HbS HbS sickle cell 
disease 

HbA HbA normal 

HbA HbS 
heterozygote 

FIG 3-15 The process of protein electrophoresis. A, A tissue 
sample is loaded in the slot at the top of the gel, and an 
electrical current is run though the gel. After staining, distinct 
bands, representing molecules with different electrical 
charges and therefore different amino acid sequences, are 
visible. B, HbA homozygotes show a single band closer to 
the positive pole, whereas HbS homozygotes show a single 
band closer to the negative pole. Heterozygotes, having both 
alleles, show two bands. 



in noncoding DNA are not usually detected by protein 
electrophoresis. 



Protein electrophoresis detects variations in 
genes that encode certain serum proteins. 
These variations are observable because pro¬ 
teins with slight differences in their amino acid 
sequence migrate at different rates through 
electrically charged gels. 


Detecting Variation at the DNA Level 

Each human haploid DNA sequence (i.e., the sequence 
inherited from one parent) differs from any other human 
haploid sequence by at least three to four million DNA base 
pairs, which amounts to one single-base difference every 
1000 base pairs. Because there are only a few hundred or so 
blood group and protein electrophoretic polymorphisms, 
these approaches have detected only a tiny fraction of human 
DNA variation. Yet the assessment of this variation is critical 
to gene identification and genetic diagnosis (see Chapters 8 
and 13). Fortunately, molecular techniques developed since 
the 1980s have enabled the detection of millions of new poly¬ 
morphisms at the DNA level. These techniques, which have 
revolutionized both the practice and the potential of medical 
genetics, are discussed next. 


Southern Blotting and Restriction Fragment Analysis 

An early approach to the detection of genetic variation at the 
DNA level took advantage of the existence of bacterial 
enzymes known as restriction endonucleases, or restriction 
enzymes. These enzymes cleave human DNA at specific 
sequences, termed restriction sites. For example, the intesti¬ 
nal bacterium Escherichia coli produces a restriction enzyme, 
called EcoRI, that recognizes the DNA sequence GAATTC. 
Each time this sequence is encountered, the enzyme cleaves 
the sequence between the G and the A (Fig. 3-16). A restric¬ 
tion digest of human DNA using this enzyme will produce 
more than 1 million DNA fragments (restriction fragments). 
These fragments are then subjected to gel electrophoresis, in 
which the smaller ones migrate more quickly through the gel 
than do the larger ones (Fig. 3-17). The DNA is denatured 
(i.e., converted from a double-stranded to a single-stranded 
form) by exposing it to alkaline chemical solutions. To fix 
their positions permanently, the DNA fragments are trans¬ 
ferred from the gel to a solid membrane, such as nitrocellu¬ 
lose (this is a Southern transfer, after the man who invented 
the process in the mid-1970s). At this point, the solid mem¬ 
brane, often called a Southern blot, contains many thousands 
of fragments arrayed according to their size. Because of their 
large number, the fragments are indistinguishable from one 
another. 

To visualize only the fragments corresponding to a specific 
region of DNA, a probe, consisting of a small piece of single- 
stranded human DNA (a few kilobases [kb] in length), is 
constructed using recombinant DNA techniques (Box 3-1) 
(Fig. 3-18). The probe is labeled, often with a radioactive 
isotope, and then exposed to the Southern blot. The probe 
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FIG 3-16 Cleavage of DNA by the EcoRI restriction enzyme. In B, the enzyme cleaves the three 
GAATTC recognition sequences, producing two smaller fragments. In A, the middle sequence 
is GAATTT instead of GAATTC, so it cannot be cleaved by the enzyme. The result is a single, 
longer fragment. 
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FIG 3-17 Restriction enzyme digestion and Southern blotting. DNA is extracted from blood 
samples from subjects A, B, and C. The DNA is digested by a restriction enzyme and then loaded 
on a gel. Electrophoresis separates the DNA fragments according to their size. The DNA is 
denatured and transferred to a solid membrane (Southern blot), where it is hybridized with a 
radioactive probe. Exposure to x-ray film (autoradiography) reveals specific DNA fragments 
(bands) of different sizes in individuals A, B, and C. 
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BOX 3-1 Genetic Engineering, Recombinant DNA, and Cloning 


In the last two decades, most of the lay public has acquired at 
least a passing familiarity with the terms "recombinant DNA," 
"cloning," and "genetic engineering." Indeed, these tech¬ 
niques lie at the heart of what is often called the "new 
genetics." 

Genetic engineering refers to the laboratory alteration of 
genes. An alteration that is of special importance in medical 
genetics is the creation of clones. Briefly, a clone is an identical 
copy of a DNA sequence. The following description outlines 
one approach to the cloning of human genes. 


Our goal is to insert a human DNA sequence into a rapidly 
reproducing organism so that copies (clones) of the DNA can 
be made quickly. One system commonly used for this purpose 
is the plasmid, which is a small, circular, self-replicating piece 
of DNA that resides in many bacteria. Plasmids can be removed 
from bacteria or inserted into them without seriously disrupting 
bacterial growth or reproduction. 

To insert human DNA into the plasmid, we need a way to cut 
DNA into pieces so that it can be manipulated. Restriction 
enzymes, discussed earlier in the text, perform this function 
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FIG 3-18 Recombinant DNA technology. Human and circular plasmid DNA are both cleaved by 
a restriction enzyme, producing sticky ends (1-3). This allows the human DNA to anneal and 
recombine with the plasmid DNA. Inserted into the plasmid DNA, the human DNA is now rep¬ 
licated when the plasmid is inserted into the Escherichia coii bacterium (4). 
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BOX 3-1 Genetic Engineering, Recombinant DNA, and Cloning—cont'd 


efficiently. The DNA sequence recognized by the restriction 
enzyme fcoRI, GAATTC, has the convenient property that its 
complementary sequence, CTTAAG, is the same sequence, 
except backward. Such sequences are called palindromes. 
When plasmid or human DNA is cleaved with fcoRI, the result¬ 
ing fragments have sticky ends. If human DNA and plasmid 
DNA are both cut with this enzyme, both types of DNA frag¬ 
ments contain exposed ends that can undergo complementary 
base pairing with each other. Then, when the human and 
plasmid DNA are mixed together, they recombine (hence the 
term recombinant DIMA). The resulting plasmids contain 
human DNA inserts. The plasmids are inserted back into bac¬ 
teria, where they reproduce rapidly through natural cell division. 
The human DNA sequence, which is reproduced along with 
the other plasmid DNA, is thus cloned (Fig. 3-18). 

The plasmid is referred to as a vector. Several other types 
of vectors may also be used as cloning vehicles, including 
bacteriophages (viruses that infect bacteria), cosmids (phage- 
plasmid hybrids capable of carrying relatively large DNA inserts), 
yeast artificial chromosomes (YACs; vectors that are inserted 
into yeast cells and that behave much like ordinary yeast chro¬ 
mosomes), bacterial artificial chromosomes (BACs), and 


human artificial chromosomes (see Chapters 8 and 13). 
Although plasmids and bacteriophages can accommodate only 
relatively small inserts (about 10 and 20 kb, respectively), 
cosmids can carry inserts of approximately 50 kb, and YACs 
can carry inserts up to 1000 kb in length. 

Cloning can be used to create the thousands of copies of 
human DNA needed for Southern blotting and other experi¬ 
mental applications. In addition, this approach is now used to 
produce genetically engineered therapeutic products, such as 
insulin, interferon, human growth hormone, clotting factor VIII 
(used in the treatment of hemophilia A, a coagulation disorder), 
and tissue plasminogen activator (a blood clot-dissolving 
protein that helps to prevent heart attacks and strokes). When 
these genes are cloned into bacteria or other organisms, the 
organism produces the human gene product along with its own 
gene products. In the past, these products were obtained from 
donor blood or from other animals. The processes of obtaining 
and purifying them were slow and costly, and the resulting 
products sometimes contained contaminants. Genetically engi¬ 
neered gene products are rapidly becoming a cheaper, purer, 
and more efficient alternative. 


undergoes complementary base pairing only with the corre¬ 
sponding complementary single-stranded DNA fragments 
on the blot identifying one or a few fragments from a specific 
portion of the DNA. To visualize the position on the blot at 
which the probe hybridizes, the blot is exposed to x-ray film, 
which darkens at the probe’s position due to the emission of 
radioactive particles from the labeled probe. These darkened 
positions are usually referred to as bands, and the film is 
termed an autoradiogram (Fig. 3-19). 

Southern blotting can be used in several ways. For 
example, it can detect insertions or deletions in DNA 
sequences, which cause specific fragments to become larger 
or smaller. If a disease-causing mutation alters a specific 
restriction site, as in the case of sickle cell disease (Fig. 3-20), 
this technique can be used as a cheap and efficient diagnostic 
tool. Because most disease-causing mutations do not affect 
restriction sites, this approach is somewhat limited, and 
other, newer techniques can be used. Finally, Southern blot¬ 
ting was instrumental in analyzing restriction fragment 
length polymorphisms (RFLPs), which are found through¬ 
out the human genome as a result of normal DNA sequence 
variation. These sequence variants were used to localize 
many important disease-causing genes, including those 
responsible for cystic fibrosis, Huntington disease, and type 
1 neurofibromatosis (see Chapter 8). 

Restriction enzymes can cut DNA into frag¬ 
ments, which are sorted according to their 
length by electrophoresis, transferred to a 
solid membrane (Southern blotting), and visu¬ 
alized through the use of labeled probes. This 
process can detect deletions or duplications of 
DNA, as well as RFLPs. 



FIG 3-19 An autoradiogram, showing the positions of a 
4.1-kb band and a 3.3-kb band. Each lane represents DNA 
from a subject in the family whose pedigree is shown above 
the autoradiogram. 

Tandem Repeat Polymorphisms 

The approach just described can detect polymorphisms that 
reflect the presence or absence of a restriction site. These 
polymorphisms have only two possible alleles, placing a limit 
on the amount of genetic diversity that can be seen. More 
diversity could be observed if a polymorphic system had 
many alleles, rather than just two. One such system exploits 
the microsatellites and minisatellites that exist throughout 
the genome. As discussed in Chapter 2, these are regions in 
which the same DNA sequence is repeated over and over, in 
tandem (Fig. 3-21). Microsatellites are typically composed of 
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FIG 3-20 Cleavage of p-globin DNA by the MstW restriction enzyme. Normal individuals have 
glutamic acid at position 6 of the P-globin polypeptide. Glutamic acid is encoded by the DNA 
sequence GAG. The sickle cell mutation results in the sequence GTG at this site instead of GAG, 
causing valine to be substituted for glutamic acid. The restriction enzyme MstW recognizes the 
DNA sequence CCTNAGG (the N signifies that the enzyme will recognize any DNA base, includ¬ 
ing G, in this position). Thus, MstW recognizes and cleaves the DNA sequence of the normal 
chromosome at this site as well as at the restriction sites on either side of it. The sickle cell 
mutation removes an Msfll recognition site, producing a longer, 1.3-kb fragment. The normal 
DNA sequence includes the restriction site (i.e., the sequence CCTGAG instead of CCTGTG), so 
a shorter, 1.1-kb fragment is produced. Therefore, on the autoradiogram, sickle cell homozygotes 
have a single 1.3-kb band, normal homozygotes have a single 1.1-kb band, and heterozygotes 
have both the 1.1-kb and the 1.3-kb bands. Because shorter fragments migrate farther on a gel, 
the two fragment sizes can easily be distinguished after hybridization of the blot with a probe 
containing DNA from the p-globin gene. Note that the banding pattern here, based on DNA 
sequence differences, resembles the banding pattern shown in Figure 3-15, which is based on 
hemoglobin amino acid sequences detected by protein electrophoresis. 


A Allele 1 ( ...TCC GTA... GTA... GTA... GTA... J ACTGTTA... ) 

B Allele 2 ( ^..TCC^GTA... GTA... GTA... GTA... GTA... GTA... GTA... GTA..!^ACTGTTaT^ 


FIG 3-21 Tandem repeat polymorphisms. Bands of differing length (A and B) are created by 
different numbers of tandem repeats in the DNA on the two copies of a chromosome. Following 
amplification and labeling of the region that contains the polymorphism, different fragment 
lengths are separated by electrophoresis and visualized on an autoradiogram. 
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units that are only 2 to 5 bp long, whereas minisatellites 
contain longer repeat units. The genetic variation measured 
is the number of repeats in a given region, which varies sub¬ 
stantially from individual to individual: a specific region 
could have as few as two or three repeats or as many as 20 or 
more. These polymorphisms can therefore reveal a high 
degree of genetic variation. Minisatellite polymorphisms are 
termed variable number of tandem repeats (VNTR), and 
micro satellite polymorphisms are termed short tandem 
repeats (STRs). The latter are especially easy to assay, and tens 
of thousands of them are distributed throughout the human 
genome. These properties make them useful for mapping 
genes by the process of linkage analysis, discussed in Chapter 
8. Both types of polymorphisms are useful in forensic appli¬ 
cations, such as paternity testing and the identification of 
criminal suspects (Box 3-2) (Fig. 3-22). 

VNTRs are a type of polymorphism that results 
from varying numbers of minisatellite repeats 
in a specific DNA region. STRs are a similar type 
of polymorphism that results from varying 


numbers of smaller, microsatellite repeats. 
Because VNTRs and STRs can have many dif¬ 
ferent alleles in a population, they are espe¬ 
cially useful in medical genetics and forensics. 

Single Nucleotide Polymorphisms 

The most numerous type of polymorphism in the human 
genome consists of variants at single nucleotide positions on 
a chromosome, or single nucleotide polymorphisms (SNPs). 
RFLPs, which are usually caused by single-base differences 
that occur only at restriction sites, are a subset of the more 
general set of SNPs. These polymorphisms, when they occur 
in functional DNA sequences, can cause inherited diseases, 
although most are harmless. Increasingly, they are being 
detected by microarray and direct sequencing methods, 
which are discussed later in this chapter. 

Copy Number Variants 

Throughout the human genome, there are sections of DNA 
that vary in their number of copies from one individual 
to another. These copy number variants (CNVs), typically 
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BOX 3-2 DNA Profiles in the Forensic Setting 


Because of the large number of polymorphisms observed in 
the human genome, it is virtually certain that each of us is 
genetically unique (with the exception of identical twins, whose 
DNA sequences are nearly always identical). It follows that 
genetic variation could be used to identify individuals, much as 
a conventional fingerprint does. Because DNA can be found in 
any tissue sample, including blood, semen, and hair,* genetic 
variation has substantial potential in forensic applications 
(e.g., criminal cases, paternity suits, identification of accident 
victims). STBs, with their many alleles, are very useful in estab¬ 
lishing a highly specific DNA profile. 

The principle underlying a DNA profile is quite simple. If we 
examine enough polymorphisms in a given individual, the prob¬ 
ability that any other individual in the population has the same 
allele at each examined locus becomes extremely small. DNA 
left at the scene of a crime in the form of blood or semen, for 
example, can be typed for a series of STBs. Because of the 
extreme sensitivity of the PCB approach, even a tiny sample 
several years old can yield enough DNA for laboratory analysis 
(although extreme care must be taken to avoid contamination 
when using PCB with such samples). The detected alleles are 
then compared with the alleles of a suspect. If the alleles in 
the two samples match, the suspect is implicated (Fig. 3-22). 


A key question is whether another person in the general 
population might have the same alleles as the suspect. Could 
the DNA profile then falsely implicate the wrong person? In 
criminal cases, the probability of obtaining an allele match with 
a random member of the population is calculated. Because of 
the high degree of allelic variation in STBs, this probability is 
usually very small. The use of 13 STBs, which is now common 
practice, yields random match or more probabilities in the 
neighborhood of 1 in 1 trillion. Provided that a large enough 
number of loci are used under well-controlled laboratory condi¬ 
tions, and provided that the data are collected and evaluated 
carefully, DNA profiles can furnish highly useful forensic evi¬ 
dence. DNA profiles are now used in many thousands of crimi¬ 
nal court cases each year. 

Although we tend to think of such evidence in terms of iden¬ 
tifying the guilty party, it should be kept in mind that when a 
match is not obtained, a suspect may be exonerated. In addi¬ 
tion, postconviction DNA testing has resulted in the release of 
hundreds of persons who were wrongly imprisoned. Thus, 
DNA profiles can also benefit the innocent. 



FIG 3-22 DNA profiles. A, An autoradiogram shows that the band pattern of DNA from suspect 
1 does not match the DNA taken from the crime scene (3), whereas the band pattern from 
suspect 2 does match. In practice, multiple STBs are assayed to reduce the possibility of a false 
match. B, STBs are now commonly assayed using a capillary gel apparatus. The resulting STB 
profile is displayed as an electropherogram, in which the locations of peaks indicate the lengths 
(in base pairs) of each STB allele. (Panel A courtesy Jay Henry, Criminalistics Laboratory, Depart¬ 
ment of Public Safety, State of Utah.) 


Even fingerprints left at a crime scene sometimes contain enough DNA for PCB amplification and DNA profiling. 
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FIG 3-23 A, Restriction fragment length polymorphisms (RFLPs) result from DNA sequence 
differences that occur at restriction sites in the human genome. The locations of these sites are 
identified by hybridizing restriction fragments with cloned probes. B, Tandem repeats consist of 
short segments of DNA (microsatellites) or somewhat longer segments (minisatellites whose 
lengths can be 14 to 500 bp) that are repeated over and over, in tandem. C, Copy number vari¬ 
ants (CNVs) represent differences in the numbers of larger repeated segments of DNA (>1000 bp 
to 2 million bp). D, Single nucleotide polymorphisms (SNPs) are single-base variations in the 
genome. 


defined as DNA sequences larger than 1000 base pairs, may 
be present in zero to more than a dozen copies in a haploid 
genome. Each human is heterozygous for at least 100 CNVs 
(i.e., he or she inherited a different number of copies from 
the mother than from the father). Although CNVs are much 
less numerous than SNPs, their large individual size means 
that they account for several million total base pair differ¬ 
ences between any pair of haploid DNA sequences (roughly 
the same amount as SNPs). Some CNVs have been shown to 
be associated with inherited diseases, and some are associated 
with response to specific therapeutic drugs. Figure 3-23 high¬ 
lights the differences among RFLPs, tandem repeats, SNPs, 
and CNVs. 

SNPs are the most common type of variation 
in the human genome. CNVs consist of dif¬ 
ferences in the number of repeated DNA 
sequences longer than 1000 bp. 

DNA Amplification Using the Polymerase 
Chain Reaction 

Because the DNA molecule is tiny, it is not possible to visual¬ 
ize DNA variation (i.e., different base pairs) directly. All 


methods of assessing DNA variation involve indirect assess¬ 
ment, as in the use of labeled probes to bind to specific DNA 
regions in Southern blotting. 

Nearly all methods of visualizing DNA variation require 
indirect labeling of DNA. To observe the labels, multiple 
copies must be made. For example, bacteria can be used to 
make thousands of cloned copies of the labeled probes used 
in Southern blotting. However, this process (see Box 3-1) is 
time-consuming, often requiring several days or more, and it 
typically requires a relatively large amount of DNA from 
the subject (several micrograms). An alternative process, 
the polymerase chain reaction (PCR), was developed in the 
mid-1980s and has made the detection of genetic variation at 
the DNA level much more efficient. Essentially, PCR is an 
artificial means of replicating a short, specific DNA sequence 
(several kb or less) very quickly, so that millions of copies of 
the sequence are made. 

The PCR process, summarized in Figure 3-24, requires 
four components: 

• Two primers, each consisting of 15 to 20 bases of DNA. 
These small DNA sequences are termed oligonucleotides 
{oligo means “a few”). The primers correspond to the DNA 
sequences immediately adjacent to the sequence of interest 
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FIG 3-24 The polymerase chain reaction process. Genomic 
DNA is first heated and denatured to form single strands. In 
the annealing phase, the DNA is cooled, allowing hybridiza¬ 
tion with primer sequences that flank the region of interest. 
Then the reaction is heated to an intermediate primer exten¬ 
sion temperature, and in this step the DNA polymerase adds 
free bases in the 3' direction along each single strand, starting 
at the primer. Blunt-ended DNA fragments are formed, and 
these provide a template for the next cycle of heating and 
cooling. Repeated cycling produces a large number of DNA 
fragments bounded on each end by the primer sequence. 


(such as a sequence that contains a tandem repeat poly¬ 
morphism or a mutation that causes disease). The oligo¬ 
nucleotide primers are synthesized using a laboratory 
instrument. 

• DNA polymerase. A thermally stable form of this enzyme, 
initially derived from the bacterium Thermus aquaticus, 


performs the vital process of DNA replication (here 

termed primer extension). 

• A large number of free DNA nucleotides. 

• Genomic DNA from an individual. Because of the extreme 

sensitivity of PCR, the quantity of this DNA can be very 

small. 

The genomic DNA is first heated to a relatively high tem¬ 
perature (approximately 95° C) so that it denatures and 
becomes single-stranded. As the single-stranded DNA is then 
cooled to a temperature of approximately 35° C to 65° C, it is 
exposed to large quantities of single-stranded primers, which 
hybridize, or anneal, to a specific location in the genomic 
DNA that contains the appropriate complementary bases. 
The DNA is then heated to an intermediate temperature 
(70°C to 75°C). In the presence of a large number of free 
DNA bases, a new DNA strand is synthesized by the DNA 
polymerase at this temperature, extending from the primer 
sequence. The newly synthesized DNA consists of a double 
strand that has the 5' end of the primer at one end, followed 
by the bases added through primer extension by DNA poly¬ 
merase. This double-stranded DNA is heated to a high tem- 
peratureagain,causingitto denature. Theheating-and-cooling 
cycle is then repeated. Now, the newly synthesized DNA 
serves as the template for further synthesis. As the cooling- 
and-heating cycles are repeated, the primer-bounded DNA 
products are amplified geometrically: the number of copies 
doubles in each cycle (i.e., 2, 4, 8, 16, etc.). This is why the 
process is termed a chain reaction. Typically, the cycles are 
repeated 20 to 30 times, producing millions of copies of the 
original DNA. In summary, the PCR process consists of three 
basic steps: DNA denaturing at high temperature, primer 
hybridization at a low temperature, and primer extension at 
an intermediate temperature. The result is a product that 
consists almost entirely of a specific DNA sequence. 

Because each heating-and-cooling cycle requires only a 
few minutes or less, a single molecule of DNA can be ampli¬ 
fied to make millions of copies in only a few hours. Because 
the procedure is simple and entirely self-contained, inexpen¬ 
sive machines have been developed to automate it com¬ 
pletely. Once the DNA is amplified, it can be analyzed in a 
variety of ways. 

PCR has several advantages over older techniques. First, it 
can be used with extremely small quantities of DNA (usually 
nanogram amounts, as opposed to the micrograms required 
for cloning). The amount of DNA in a several-year-old blood 
stain, a single hair, or even the back of a licked postage stamp 
is often sufficient for analysis. Second, because it does not 
require cloning, the procedure is much faster than older 
techniques. Genetic testing for sickle cell disease, for example, 
can be done in a single day with PCR. Finally, because PCR 
can make large quantities of very pure DNA, it is less often 
necessary to use radioactive probes to detect specific DNA 
sequences or mutations. Instead, safer, nonradioactive sub¬ 
stances, such as biotin, can be used. 

PCR does have some disadvantages. First, primer synthesis 
requires identification of the DNA sequence flanking the 
DNA of interest. If this sequence is ambiguous, other tech¬ 
niques must be used. Second, the extreme sensitivity of PCR 
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makes it susceptible to contamination in the laboratory. A 
number of precautions are commonly taken to guard against 
contamination. Finally, because it can be difficult to apply 
PCR to sequences longer than a few kilobases, it is not typi¬ 
cally useful for detecting larger deletions (i.e., it is difficult or 
impossible to amplify the longer, normal sequence). Southern 
blotting or other techniques are used instead. 

Because PCR is such a powerful and versatile technique, it 
is now used extensively in genetic disease diagnosis, forensic 
medicine, and evolutionary genetics. PCR is so sensitive that 
it has been used to analyze DNA from ancient mummies and 
even from Neanderthal specimens more than 30,000 years 
old. Analysis of these specimens showed that modern humans 
are genetically distinct from Neanderthals but that most 
modern humans have a small amount of Neanderthal DNA 
(1-3%) in their genomes. 

PCR provides a convenient and efficient means 
of making millions of copies of a short DNA 
sequence. Heating-and-cooling cycles are used 
to denature DNA and then build new copies of 
a specific, primer-bounded sequence. Because 
of its speed and ease of use, this technique is 
now widely used for assessing genetic varia¬ 
tion, for diagnosing genetic diseases, and for 
performing forensic investigations. 

DNA Sequencing 

In many genetic studies, a primary goal is to determine the 
actual array of DNA base pairs that makes up a gene or part 
of a gene. Such a DNA sequence can indicate a great deal 
about the nature of a specific mutation, the function of a 
gene, and the gene’s degree of similarity to other known 
genes. We first discuss a technique that has been widely used 
to determine DNA sequences. 

The dideoxy method of DNA sequencing, invented by 
Frederick Sanger, makes use of chain-terminating dideoxy- 
nucleotides. These are chemically quite similar to ordinary 
deoxynucleotides, except that they are missing one hydroxyl 
group. This prevents the subsequent formation of phospho- 
diester bonds with free DNA bases. Thus, although dideoxy- 
nucleotides can be incorporated into a growing DNA helix, 
no additional nucleotides can be added once they are included. 

Four different dideoxynucleotides are used, each corre¬ 
sponding to one of the four nucleotides (A, C, G, and T). The 
single-stranded DNA whose sequence we wish to determine 
is mixed with labeled primers, DNA polymerase, ordinary 
nucleotides, and one type of dideoxynucleotide (Fig. 3-25). 
The primer hybridizes to the appropriate complementary 
position in the single-stranded DNA, and DNA polymerase 
adds free bases to the growing DNA molecule, as in the PCR 
process. At any given position, either an ordinary nucleotide 
or the corresponding dideoxynucleotide may be incorporated 
into the chain; this is a random process. However, once a 
dideoxynucleotide is incorporated, the chain is terminated. 
The procedure thus yields DNA fragments of varying length, 
each ending with the same dideoxynucleotide. 


The DNA fragments can be separated according to length 
by electrophoresis, as discussed previously. Four different 
sequencing reactions are run, one for each base. The frag¬ 
ments obtained from each reaction are electrophoresed side 
by side on the same gel, so that the position of each fragment 
can be compared. Because each band corresponds to a DNA 
chain that terminates with a unique base, the DNA sequence 
can be read by observing the order of the bands on the gel 
after autoradiography or other detection methods. Several 
hundred base pairs can usually be sequenced in one reaction 
series. 

DNA sequencing can be accomplished using 
the dideoxy method. This method depends on 
the fact that dideoxynucleotides behave in a 
fashion similar to ordinary deoxynucleotides, 
except that once they are incorporated into the 
DNA chain, they terminate the chain. They 
thus mark the positions of specific bases. 

It should be apparent that this method of sequencing DNA 
is a relatively slow, laborious, and error-prone process. Nowa¬ 
days, most strategies for DNA sequencing are automated and 
use fluorescent, chemiluminescent, or colorimetric detection 
systems. The use of fluorochrome-labeled primers or dide¬ 
oxynucleotides is the most popular method, partly because it 
was easily adapted for rapid automation. 

Typically, a DNA template is sequenced using a method 
similar to the primer extension step in PCR. Each of the four 
different nucleotides can be labeled with a fluorochrome 
that emits a distinct spectrum of light. The fluorochrome- 
labeled reaction products are electrophoresed through a 
very thin polyacrylamide gel or through a thin capillary 
tube; as they migrate past a window, they are excited by a 
beam of light from a laser. The emitted light is captured by 
a digital camera for translation into an electronic signal, 
and a composite gel image is generated. This gel image is 
analyzed to produce a graph in which each of the four dif¬ 
ferent nucleotides is depicted by a different-colored peak 
(Fig. 3-26). Automated sequencers can also be adapted to 
assay STRs, single-nucleotide polymorphisms, and other 
types of polymorphisms. 

By using computers and advanced automated technology, 
approaches such as these have greatly increased the potential 
speed of DNA sequencing. These techniques have permitted 
the completion of the entire 3-billion-bp human DNA 
sequence. 

Automated DNA sequencing, using fluorescent 
labels and laser detection, greatly increases the 
speed and efficiency of the sequencing process. 

High-Throughput DNA Sequencing 

During the past several years, the development of new high- 
throughput DNA sequencing technology has decreased the 
cost of DNA sequencing by many orders of magnitude (this 
technology is also sometimes termed “next generation” or 
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FIG 3-25 DNA sequencing by the dideoxy (Sanger) method. Labeled primer is added to the 
single-stranded DNA whose sequence is unknown. DNA polymerase adds free bases to the 
single strand, using complementary base pairing. Four different reactions are carried out, cor¬ 
responding to the four dideoxynucleotides (ddATP, ddCTP, ddGTP, and ddTTP). Each of these 
terminates the DNA sequence whenever it is incorporated in place of the normal deoxynucleotide 
(dATP, dCTP, dGTP, and dTTP, corresponding to the bases A, C, G, and T, respectively). The 
process results in fragments of varying length, which can be separated by electrophoresis. The 
position of each fragment is indicated by the emission of radioactive particles from the label, 
which allows the DNA sequence to be read directly. 


“massively parallel” sequencing). In one common approach, 
genomic DNA is chopped at random into small segments, 
typically 100 to several hundred bp in size (Fig. 3-27). Short 
synthetic DNA sequences, termed adapters, are joined to the 
ends of the genomic DNA fragments. These double-stranded 
DNA fragments are separated into single strands and then 
attached to a solid surface, such as a glass slide. Each indi¬ 
vidual DNA fragment is amplified by PCR into a cluster of 
thousands of identical copies, using the adapters as primer 
sequences (the multiple copies provide a signal strong enough 


to be visualized by a camera, as described below). A sequenc¬ 
ing reaction then occurs, in which these fragment clusters 
serve as templates for synthesizing complementary sequences. 
Similar to the Sanger sequencing process described previ¬ 
ously, new complementary bases (to which a base-specific 
fluorescent label is attached) are added one at a time. The 
fluorescent signal from each cluster is recorded by a camera, 
revealing the base-pair sequence of each fragment. The key 
advantage of this approach is that millions of different DNA 
fragments are sequenced simultaneously, in contrast to older 
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CTCCAGCCGCC 



FIG 3-26 Analyzed data from a single DNA template 
sequenced on an automated DNA sequencer. Peaks of dif¬ 
ferent colors represent the identity and relative location of 
different nucleotides in the DNA sequence. For example, the 
leftmost peak is blue and identifies the position of a cytosine. 
The next peak is red, indicating the presence of a thymine. 
This base-calling continues until the end of the DNA template 
is reached (typically a few hundred base pairs). 


methods in which only a few dozen fragments are sequenced 
at a time. 

When the sequencing reactions are completed, millions 
of short DNA sequences must be stitched together to form 
a completed assembly. Each short sequence is compared 
(aligned) to a fully assembled reference sequence, typically 
the first published human genome or a consensus sequence 
of multiple genomes. Each base pair of the reference genome 
is covered by multiple short sequences (“reads”) so that it is 
sequenced 30 to 60 or more times. A consensus base call is 
made from the short sequences. Since sequencing errors will 
appear in very few reads, they will be removed, and true 
mutations and polymorphisms will remain as differences 
from the reference sequence. Genomic regions that are 
present in many copies (like mobile elements) or have 
repeated sequences within them (like STRs) can be a chal¬ 
lenge for this method. 

There are many variations on this theme, including tech¬ 
niques in which single DNA molecules are sequenced, rather 
than clusters of copies. In many applications (see Chapter 
13), the protein-coding exons are of primary interest, so DNA 
probes can be used to “pull down” only this portion of the 
genome (termed the exome) for sequencing. These approaches 
have been adapted to sequence RNA and to assess methyla- 
tion patterns in the genome. 

Because of this new technology, the sequencing of a human 
genome, which required months of time and millions of 
dollars just a few years ago, can now be completed in several 
hours at a cost of one to several thousand dollars. Many 
thousands of human genomes have been sequenced using 
high-throughput sequencing, leading to the discovery of 
disease-causing genes and to more effective genetic testing 
(see Chapters 8 and 13). 

Detection of Mutations at the DNA Level 

The detection of mutations or polymorphisms in DNA 
sequences is often a critical step in understanding how a 
gene causes a specific disease. New molecular methods have 
spawned a number of techniques for detecting DNA sequence 
variation. Many of the techniques summarized in Table 3-4 
can provide rapid and efficient screening for the presence 
of mutations. These methods can indirectly indicate the exis¬ 
tence and location of a mutation, after which the DNA in the 
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ACTAAAAGCCTACGCGCGCTACGACATCACGAGCGCGCGCGGAACTAAAGC 
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FIG 3-27 Fligh-throughput DNA sequencing. In one com¬ 
monly used approach, whole genomic DNA, typically obtained 
from a blood or saliva sample, is mechanically sheared into 
small fragments (100 to several hundred bp). Standard 
adapter sequences are attached (ligated) to the ends of the 
DNA fragments, which are then converted from double- 
stranded to single-stranded form and attached to a glass 
slide. The attached adapter sequences serve as primers for 
PCR amplification, creating thousands of copies of each DNA 
fragment. Clusters of identical single-stranded fragments are 
templates for a sequencing reaction in which complementary 
bases are added one at a time. These complementary bases 
are labeled by a fluorescent dye specific for A, C, G, or T, 
allowing the bases to be visualized by a camera. The sequenc¬ 
ing reactions occur in parallel, rapidly creating millions of 
short sequence "reads." The sequence reads, which typically 
cover each base 40 to 50 times, are aligned to a reference 
genome sequence (shown at the bottom of the figure in 
orange) to recreate the individual's genome sequence. 
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TABLE 3-4 Methods of Mutation Detection 


TECHNIQUE 

BRIEF DESCRIPTION 

APPLICATION 

Southern blotting 

Digestion of test DNA with restriction enzyme; 
resolution of fragments with agarose gel 
electrophoresis; transfer of DNA to nylon membrane 
and hybridization of labeled probe to DNA fragments 

Detection of insertions, deletions, 
rearrangements; ordering of DNA 
fragments into physical map 

Analysis of PCR 

PCR products are sorted by size using electrophoresis 

Detection of small insertions and deletions 

product size 

on an agarose or polyacrylamide gel 

and triplet repeat expansions 

Direct DNA 

Determination of linear order of nucleotides of test DNA; 

Detection of insertions, deletions, point 

sequencing 

specific nucleotide detected by chemical cleavage, 
dideoxy-chain termination, or fluorochrome dye 

mutations, rearrangements 

DNA mismatch 

Hybridization of a labeled probe to test DNA; cleavage of 

Detection of small insertions or deletions. 

cleavage 

DNA at site of base pairing mismatch 

point mutations 

Allele-specific 

oligonucleotide 

Preferential hybridization of labeled probe to test DNA 
with uniquely complementary base composition 

Detection of alleles of known composition 

(ASO) hybridization 

Multiplex ligation- 

Ligation of DNA fragments after hybridization of probes 

Detection of deletions and duplications of 

dependent probe 

specific to a region 

exons or whole genes 

amplification (MLPA) 

Mass spectrometry 

Detection of physical mass of sense and antisense 
strands of test DNA 

Detection of small insertions or deletions, 
point mutations 

DNA microarray 

Hybridization of test DNA to arrays of oligonucleotides 

Detection of SNPs, CNVs, expression 

hybridization 

ordered on silicone chip or glass slide 

differences 

Protein truncation 

Test DNA used to make complementary DNA (cDNA) 
by RT-PCR with 5' primer containing T7 promoter; 
cDNA translated and product resolved by SDS-PAGE 

Detection of frameshift, splice-site, or 
nonsense mutations that truncate the 
protein product 


RT-PCR, Reverse transcriptase-polymerase chain reaction; SDS-PAGE, sodium dodecyl sulfate polyacrylamide gel electrophoresis. 


indicated region can be sequenced to identify the specific 
mutation. Direct sequencing of DNA is a useful and accurate 
means of detecting mutations and is regarded as the definitive 
method of identifying and verifying mutations. As it becomes 
less expensive, direct DNA sequencing is being used with 
increasing frequency. 

A great deal of progress has heen made in fabricating DNA 
microarrays (also known as DNA chips) and using them for 
detection of DNA variation (Fig. 3-28). To make a DNA 
microarray, robots place single-stranded oligonucleotides on 
a small glass slide. A single slide (1 cm^) can contain millions 
of different oligonucleotides. These oligonucleotides consist 
of normal DNA sequences as well as DNA sequences that 
contain known disease-causing mutations. Fluorescently 
labeled single-stranded DNA from a subject is hybridized 
with the oligonucleotides on the slide to determine whether 
the DNA hybridizes with the normal or with the mutation- 
containing oligonucleotides, and the pattern of hybridization 
signals is analyzed by a computer. With current technology, 
enough probes can be placed on a single microarray to 
analyze variation in one to several million SNPs in an indi¬ 
vidual. Microarrays are also used to examine copy number 
variants, methylation patterns in a person’s genome, and 
genetic variation in various pathogenic organisms. A key dif¬ 
ference between microarrays and the methods summarized 
in the preceding paragraph is that microarrays typically test 
for known mutations that are incorporated in oligonucle¬ 
otide probes. A rare, previously unidentified mutation cannot 
be detected by conventional microarrays. 


Still another application of DNA microarrays is to deter¬ 
mine which genes are being expressed (i.e., transcribed) in a 
given tissue sample (e.g., from a tumor). mRNA from the 
tissue is extracted and used as a template to form a comple¬ 
mentary DNA sequence, which is then hybridized on the slide 
with oligonucleotides representing many different genes. The 
pattern of positive hybridization signals indicates which 
genes are expressed in the tissue sample. The DNA microar¬ 
ray approach offers the extraordinary speed, miniaturization, 
and accuracy of computer-based mutation analysis. Tests for 
specific mutations, an important aspect of genetic diagnosis, 
are discussed further in Chapter 13. 

Many techniques can be used to detect muta¬ 
tions at the DNA sequence level. These include 
Southern blotting, direct DNA sequencing, and 
microarray analysis. Microarrays are used in 
mutation detection, gene expression analysis, 
and a wide variety of other applications. 

GENETIC VARIATION IN POPULATIONS 

Although mutation is the ultimate source of genetic varia¬ 
tion, it cannot alone account for the substantial differences 
in the incidence of many genetic diseases among different 
human populations. Why, for example, is sickle cell disease 
seen in approximately 1 of every 600 African Americans, but 
seldom in northern Europeans? Why is cystic fibrosis 40 times 
more common in Europeans than in Asians? In this section. 
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FIG 3-28 A, Schematic diagram of a microarray. Oligonucleotides are placed or synthesized on 
a chip. They are then exposed to labeled DNA from a subject. Hybridization occurs only if the 
oligonucleotide contains a DNA sequence that is complementary to that of the subject's DNA. 
The fluorescent label marks the location of the complementary oligonucleotide sequence on the 
chip. B, A microarray containing 36,000 oligonucleotides. This microarray was exposed to DNA 
from normal fibroblasts {red, see arrows) and fibroblasts from a patient with Niemann-Pick 
disease, type C (green). Arrows point to regions in which there was a strong hybridization signal 
with either normal or disease DNA. This microarray was used to search for genes that are highly 
expressed In the fibroblasts of patients. 


concepts are introduced that explain these differences. The 
study of genetic variation in populations is an important 
focus of population genetics. 

Basic Concepts of Probability 

Probahility plays a central role in genetics because it helps us 
to understand the transmission of genes through generations, 


and it helps to explain and analyze genetic variation in popula¬ 
tions. It also aids in risk assessment, an important part of 
medical genetics. For example, the physician or genetic coun¬ 
selor commonly informs couples about their risk of producing 
a child with a genetic disorder. A probability is defined as the 
proportion of times that a specific outcome occurs in a series 
of events. Thus, we may speak of the probability of obtaining 
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a 4 when a die is tossed, or the prohahility that a couple will 
produce a son rather than a daughter. Because probabilities 
are proportions, they lie between 0 and 1, inclusive. 

During meiosis, one member of a chromosome pair is 
transmitted to each sperm or egg cell. The probability that a 
given member of the pair will be transmitted is 1/2, and the 
probability that the other member of the pair will be trans¬ 
mitted is also 1/2. (Note that the probabilities of all possible 
events must add to 1 for any given experiment.) Because this 
situation is directly analogous to coin tossing, in which the 
probabilities of obtaining heads or tails are each 1/2, we will 
use coin tossing as our illustrative example. 

When a coin is tossed repeatedly, the outcome of each toss 
has no effect on subsequent outcomes. Each event (toss) is 
said to be independent. Even if we have obtained 10 heads 
in a row, the probability of obtaining heads or tails on the 
next toss remains 1/2. Similarly, the probability that a parent 
will transmit one of the two alleles at a locus is independent 
from one reproductive event to the next. 

The independence principle allows us to deduce two 
fundamental concepts of probability, the multiplication 
rule and the addition rule. The multiplication rule states 
that if two trials are independent, then the probability of 
obtaining a given outcome in both trials is the product of 
the probabilities of each outcome. For example, we may wish 
to know the probability of obtaining heads on both tosses 
of a fair coin. Because the tosses are independent events, this 
probability is given by the product of the probabilities of 
obtaining heads in each individual toss: 1/2 X 1/2 = 1/4. 
Similarly, the probability of obtaining two tails in a row is 
1/2 X 1/2 = 1/4. 

The multiplication rule can be extended for any number 
of trials. Suppose a couple wants to know the probability that 
all three of their planned children will be girls. Because the 
probability of producing a girl is approximately 1/2, and 
because reproductive events are independent of one another, 
the probability of producing three girls is 1/2 x 1/2 x 1/2 = 
1/8. However, if the couple has already produced two girls 
and then wants to know the probability of producing a third 
girl, it is simply 1/2. This is because the previous two events 
are no longer probabilities; they have actually occurred. 
Because of independence, these past events have no effect on 
the outcome of the third event. 

The addition rule states that if we want to know the prob¬ 
ability of either one outcome or another, we can simply add 
the respective probabilities together. For example, the prob¬ 
ability of getting two heads in a row is 1/2 x 1/2, or 1/4, and 
the probability of getting two tails in a row is the same. The 
probability of getting either two heads or two tails in a total 
of two tosses is the sum of the probabilities: 1/4 + 1/4 = 1/2. 
As another example, imagine that a couple plans to have 
three children, and they have a strong aversion to having 
three children all of the same sex. They can be reassured 
somewhat by knowing that the probability of producing 
three girls or three boys is only 1/8 -I- 1/8, or 1/4. The prob¬ 
ability that they will have some combination of boys and 
girls is 3/4 because the sum of the probabilities of all pos¬ 
sible outcomes must add up to 1. 


Basic probability enables us to understand 
and estimate genetic risks and to understand 
genetic variation among populations. The mul¬ 
tiplication rule is used to estimate the probabil¬ 
ity that two events will occur together. The 
addition rule is used to estimate the probabil¬ 
ity that one event or another will occur. 

Gene and Genotype Frequencies 

The prevalence of many genetic diseases varies considerably 
from one population to another. The concepts of genotype 
frequency and gene frequency help us to measure and 
understand population variation in the incidence of genetic 
disease. 

Imagine that we have typed 200 persons in a population 
for the MN blood group. This blood group, which is encoded 
by a locus on chromosome 2, has two major alleles, labeled 
M and N. In the MN system, the effects of both alleles can be 
observed in the heterozygote. M and N are therefore said to 
be codominant: The heterozygote can be distinguished from 
both homozygotes. Any individual in the population can have 
one of three possible genotypes (recall that the genotype is 
one’s genetic makeup at a locus): He or she could be homo¬ 
zygous for M (genotype MM), heterozygous (MN), or homo¬ 
zygous for N (NN). After typing each person in our sample, 
we find the following distribution of genotypes: MM, 64; 
MN, 120; NN, 16. The genotype frequency is obtained simply 
by dividing each genotype count by the total number of sub¬ 
jects. The frequency of MM is 64/200, or 0.32; the frequency 
of MN is 120/200, or 0.60; and the frequency of NN is 16/200, 
or 0.08. The sum of these frequencies must equal 1. 

The gene frequency for each allele, M and N, can be 
obtained here by the process of gene counting. Each MM 
homozygote has two M alleles, and each heterozygote has one 
M allele. Similarly, NN homozygotes have two N alleles, and 
heterozygotes have one N allele. In the example described, 
there are 

(64 X 2) -I- 120 = 248/W alleles 

(16x2)-1-120 = 152 A/alleles 

In total, there are 400 alleles at the MN locus (i.e., twice 
the number of subjects, because each subject has two alleles). 
To obtain the frequency of M, we divide the number of M 
alleles by the total number of alleles at that locus: 248/400 = 
0.62. Likewise, the frequency of N is 152/400, or 0.38. The 
sum of the two frequencies must equal 1. 

Gene and genotype frequencies specify the 
proportions of each allele and each genotype, 
respectively, in a population. Under simple 
conditions these frequencies can be estimated 
by direct counting. 

The Hardy-Weinberg Principle 

The example given for the MN locus presents an ideal 
situation for gene frequency estimation because, owing to 
co-dominance, the three genotypes can easily be distin¬ 
guished and counted. What happens when one of the 
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homozygotes is indistinguishable from the heterozygote (i.e., 
when there is dominance)? Here the basic concepts of prob¬ 
ability can be used to specify a predictable relationship 
between gene frequencies and genotype frequencies. 

Imagine a locus that has two alleles, labeled A and a. 
Suppose that, in a population, we know the frequency of allele 
A, which we will call p, and the frequency of allele a, which 
we will call q. From these data, we wish to determine the 
expected population frequencies of each genotype, AA, Aa, 
and aa. We will assume that individuals in the population 
mate at random with regard to their genotype at this locus 
(random mating is also referred to as panmixia). Thus, the 
genotype has no effect on mate selection. If men and women 
mate at random, then the assumption of independence is 
fulfilled. This allows us to apply the addition and multiplica¬ 
tion rules to estimate genotype frequencies. 

Suppose that the frequency, p, of allele A in our population 
is 0.7. This means that 70% of the sperm cells in the popula¬ 
tion must have allele A, as must 70% of the egg cells. Because 
the sum of the frequencies p and q must be 1, 30% of the egg 
and sperm cells must carry allele a (i.e., q = 0.30). Under 
panmixia, the probability that a sperm cell carrying A will 
unite with an egg cell carrying A is given by the product of 
the gene frequencies: p X p = p^ = 0.49 (multiplication rule). 
This is the probability of producing an offspring with the AA 
genotype. Using the same reasoning, the probability of pro¬ 
ducing an offspring with the aa genotype is given hj qx q = 
cf =0.09. 

What about the frequency of heterozygotes in the popula¬ 
tion? There are two ways a heterozygote can be formed. Either 
a sperm cell carrying A can unite with an egg carrying a, or 
a sperm cell carrying a can unite with an egg carrying A. The 
probability of each of these two outcomes is given by the 
product of the gene frequencies, pq. Because we want to know 
the overall probability of obtaining a heterozygote (i.e., the 
first event or the second), we can apply the addition rule, 
adding the probabilities to obtain a heterozygote frequency 
of 2pq. These operations are summarized in Figure 3-29. The 
relationship between gene frequencies and genotype frequen¬ 
cies was established independently by Godfrey Hardy and 
Wilhelm Weinberg and is termed the Hardy-Weinberg 
principle. 

As already mentioned, this principle can be used to esti¬ 
mate gene and genotype frequencies when dominant homo¬ 
zygotes and heterozygotes are indistinguishable. This is often 
the case for recessive diseases such as cystic fibrosis. Only the 
affected homozygotes, with genotype aa, are distinguishable. 
The Hardy-Weinberg principle tells us that the frequency of 
aa should be q^. For cystic fibrosis in the European popula¬ 
tion, q^ = 1/2500 (i.e., the prevalence of the disease among 
newborns). To estimate q, we take the square root of both 
sides of this equation: q = ^^IjlbOO = 1/50 = 0.02. Because p + 
q=l,p = 0.98. We can then estimate the genotype frequencies 
of AA and Aa. The latter genotype, which represents hetero¬ 
zygous carriers of the disease allele, is of particular interest. 
Because p is almost 1.0, we can simplify the calculation by 
rounding p up to 1.0 without a significant loss of accuracy. 
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FIG 3-29 The Hardy-Weinberg principle. The population fre¬ 
quencies of genotypes AA, Aa, and aa are predicted on the 
basis of gene frequencies (p and q). It is assumed that the 
gene frequencies are the same in males and females. 


We then find that the frequency of heterozygotes is 2pq = 
2q = 2/50 = 1/25. This tells us something rather remarkable 
about cystic fibrosis and about recessive diseases in general. 
Whereas the incidence of affected homozygotes is only 1 in 
2500, heterozygous carriers of the disease gene are much 
more common (1 in 25 individuals). The vast majority of 
recessive disease alleles, then, are effectively “hidden” in the 
genomes of heterozygotes. 



Under panmixia, the Hardy-Weinberg princi¬ 
ple specifies the relationship between gene 
frequencies and genotype frequencies. It is 
useful in estimating gene frequencies from 
disease prevalence data and in estimating the 
incidence of heterozygous carriers of recessive 
disease genes. 


Causes of Genetic Variation 

Mutation is the source of all genetic variation, and new 
genetic variants can be harmful, can be beneficial, or can have 
no effect whatsoever. Natural selection is often described as 
the “editor” of genetic variation. It increases the population 
frequency of favorable mutations (i.e., those who carry the 
mutation will produce more surviving offspring), and it 
decreases the frequency of variants that are unfavorable in a 
given environment (i.e., gene carriers produce fewer surviv¬ 
ing offspring). Typically, disease-causing mutations are con¬ 
tinually introduced into a population through the error 
processes described earlier. At the same time, natural selec¬ 
tion removes these mutations. 

Certain environments, however, can confer a selective 
advantage for a disease mutation. Sickle cell disease again 
provides an example. As discussed previously, persons who 
are homozygous for the sickle cell mutation are much more 
likely to die early. Heterozygotes ordinarily have no particular 
advantage or disadvantage. However, it has been shown that 
sickle cell heterozygotes have a distinct survival advantage in 
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FIG 3-30 Correspondence between the 
frequency of the sickle cell allele and 
the distribution of Plasmodium falciparum 
malaria. 


Frequency of Distribution of 
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environments in which Plasmodium falciparum malaria is 
common (e.g., west-central Africa) (Fig. 3-30). Because the 
malaria parasite does not survive well in the erythrocytes of 
sickle cell heterozygotes, these persons are less likely to 
succumb to malaria than are normal homozygotes, confer¬ 
ring a selective advantage on the sickle cell mutation in this 
environment. Although there is selection against the muta¬ 
tion in sickle cell homozygotes, there is selection for the 
mutation in heterozygotes. The result is that the disease- 
causing mutation persists at a relatively high frequency in 
many African and Mediterranean populations. In nonma- 
larial environments (e.g., northern Europe), the sickle cell 
mutation has no advantage, so natural selection acts strongly 
against it by eliminating homozygotes. This example illus¬ 
trates the concept that variation in genetic disease incidence 
among populations can be caused by natural selection oper¬ 
ating differentially in different environments. 

As environments change, or as human populations occupy 
new environments, natural selection can act to increase the 
frequencies of variants that permit more successful adapta¬ 
tion. For example, natural selection has acted on several loci 
that affect skin pigmentation as humans moved from lower 
tropical latitudes to higher latitudes. As human populations 
in northern Europe and parts of Africa began to drink 
cow’s milk through adulthood, natural selection increased 
the frequency of hereditary lactase persistence, allowing 
adults to effectively metabolize lactose (in most populations, 
lactase levels decline after weaning). Natural selection has 
acted on components of the hypoxia-inducing factor pathway, 
allowing some human populations to adapt successfully to 
oxygen-deficient high-altitude environments such as the 
Tibetan plateau and the Andean mountains. 

Natural selection is the evolutionary process in 
which alleles that confer survival or reproduc¬ 
tive advantages in a specific environment are 
selected positively to increase in frequency, 
and alleles that confer lower survival or repro¬ 
ductive disadvantages are selected negatively 
so that they decrease in frequency. 


Genetic drift is another force that can cause disease genes 
to vary in frequency among populations. To understand the 
process of genetic drift, consider a coin-tossing exercise in 
which 10 coins are tossed. Because heads and tails are equally 
likely, the expected number of heads and tails in this exercise 
would be 5 each. However, it is intuitively clear that, by 
chance, a substantial departure from this expectation could 
be observed. It would not be surprising to see 7 heads and 3 
tails in 10 throws, for example. However, if 1000 coins are 
tossed, the degree of departure from the expected ratio of 
50% heads and 50% tails is much smaller. A reasonable 
outcome of 1000 throws might be 470 heads and 530 tails, 
but it would be quite unlikely to obtain 700 heads and 300 
tails. Therefore, there is less random fluctuation in larger 
samples. 

The same principle applies to gene frequencies in popula¬ 
tions. In a very small population, a gene frequency can deviate 
substantially from one generation to the next, but this is 
unlikely in a large population. Thus, genetic drift is greater in 
smaller populations. As a result, genetic diseases that are oth¬ 
erwise uncommon may be seen fairly frequently in a small 
population. Eor example, Ellis-van Creveld syndrome, a rare 
disorder that involves reduced stature, polydactyly (extra 
digits), and congenital heart defects, is seen with greatly ele¬ 
vated frequency among the Old Order Amish population of 
Pennsylvania. The Amish population was founded in the 
United States by about 50 couples. Because of this small pop¬ 
ulation size, there was great potential for genetic drift, result¬ 
ing in increased frequencies of certain disease-causing alleles. 

It is common to observe the effect of genetic drift in small, 
isolated populations throughout the world. Even relatively 
large populations might have experienced drift effects in the 
recent past if they underwent severe population bottlenecks 
or were established by a small number of founders (founder 
effect). For example, more than 30 otherwise rare genetic 
diseases are found with elevated frequency in Finland’s popu¬ 
lation, which is thought to have been founded primarily by a 
small number of individuals some 100 generations ago. Phe¬ 
nylketonuria and cystic fibrosis, which are common in other 
Western European populations, are relatively rare in Finland, 
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illustrating the fact that genetic drift can both increase and 
decrease the frequency of disease genes. Several genetic dis¬ 
eases (e.g., torsion dystonia, Tay-Sachs disease, Gaucher 
disease) occur with increased frequency in the Ashkenazi 
Jewish population (see Chapter 7); this may be the result of 
population bottlenecks that have occurred in the history of 
this population. 

Genetic drift is a random evolutionary process 
that produces larger changes in gene frequen¬ 
cies in smaller populations. Founder effect, in 
which small founder populations can experi¬ 
ence large changes in gene frequency because 
of their small size, is a special case of genetic 
drift. 

Gene flow occurs when populations exchange migrants 
who mate with one another. Through time, gene flow between 
populations tends to make them genetically more similar to 
each other. One reason sickle cell disease is less common in 
African Americans than in many African populations is 
because of gene flow between African Americans and Euro¬ 
pean Americans (this same process is likely to have increased 
the frequency of cystic fibrosis in the African American popu¬ 
lation). In addition, because P. falciparum malaria is not 
found in North America, natural selection does not favor the 
sickle cell mutation. 

The forces of mutation, natural selection, genetic drift, 
and gene flow interact in complex and sometimes unexpected 
ways to influence the distribution and prevalence of genetic 
diseases in populations. The interplay of mutation, which 
constantly introduces new variants, and natural selection, 
which often eliminates them, is an important and medically 
relevant example of such an interaction. A simple analysis of 
the relationship between mutation and selection helps us to 
understand variation in gene frequencies. Consider, for 
example, a dominant disease that results in death before the 
person can reproduce. This is termed a genetic lethal muta¬ 
tion because, even though the individual might survive for 
some time, he or she contributes no genes to the next genera¬ 
tion. Each time mutation introduces a new copy of the lethal 
dominant disease allele into a population, natural selection 
eliminates it. In this case, p, the gene frequency of the lethal 
allele in the population, is equal to p, the mutation rate [p = 
p). Now, suppose that those who inherit the allele can survive 
into their reproductive years, but, on average, they produce 
30% fewer children than those who do not inherit the allele. 
This reduction in offspring represents the selection coeffi¬ 
cient, s, of the allele. In this case, s = 0.30. When the allele is 
completely lethal, s = 1 (i.e., no children are produced). We 


can now estimate the gene frequency for this allele as p = p/s. 
As we would expect, the predicted frequency of an allele that 
merely reduces the number of offspring is higher (given the 
mutation rate) than the frequency of an allele that is com¬ 
pletely lethal, where p = p/s = p. This predictable relationship 
between the effects of mutation and selection on gene fre¬ 
quencies is termed mutation-selection balance. 

We can use the same principles to predict the relationship 
between mutation and selection against recessive alleles. The 
Hardy-Weinberg principle shows that most copies of harmful 
recessive alleles are found in heterozygotes and are thus pro¬ 
tected from the effects of natural selection. We would there¬ 
fore expect their gene frequencies to be higher than those of 
harmful dominant alleles that have the same mutation rate. 
Indeed, under mutation-selection balance, the predicted fre¬ 
quency of a recessive allele, q, that is lethal in homozygotes is 
(j = Vp (because p < 1, ^ > p, which results in a relatively 
higher allele frequency for lethal recessive alleles). If the allele 
is not lethal in homozygotes, then q = , where s is again 

the selection coefficient for those who have a homozygous 
affected genotype. Thus, understanding the principle of 
mutation-selection balance helps to explain why, in general, 
the gene frequencies for recessive disease-causing alleles are 
higher than are the frequencies of dominant disease-causing 
alleles. 

Mutation-selection balance predicts a rela¬ 
tively constant gene frequency when new 
mutations introduce harmful alleles, whereas 
natural selection removes them. This process 
predicts that the gene frequencies should be 
lower for dominant diseases, in which most 
alleles are exposed to natural selection, than 
in recessive diseases, where most alleles are 
found in heterozygotes and are thus protected 
from natural selection. 

The techniques described earlier in this chapter (e.g., SNP 
microarrays and high-throughput DNA sequencing) have 
now been used extensively to assay variation among individu¬ 
als and populations. Because these technologies provide 
information about millions of genetic variants in each indi¬ 
vidual, they offer unprecedented opportunities to investigate 
important questions about the origins of modern humans 
and the differences and similarities among individuals and 
populations (see Chapter 14). In addition, they allow inves¬ 
tigators to test hypotheses about the ways in which evolution¬ 
ary processes such as natural selection, gene flow, and genetic 
drift have influenced the distribution of genetic diseases in 
human populations. 




I STUDY QUESTIONS _ 

Normal: Phe-Asn-Pro-Thr-Arg 
Mutation 1: Phe-Asn-Pro 
Mutation 2: Phe-Asn-Ala-His-Thr 
Mutation 3: Phe-His-Pro-Thr-Arg 


1. In the following list, the normal amino acid sequence is 
given first, followed by sequences that are produced by 
different types of mutations. Identify the type of mutation 
most likely to cause each altered amino acid sequence. 
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200 bp 
196 bp 
192 bp 
188 bp 
184 bp 

FIG 3-31 Autoradiogram for study question 4. 




FIG 3-32 Autoradiogram for study question 5. 


2. Missense and transcription (promoter, enhancer, tran¬ 
scription factor) mutations often produce milder disease 
conditions than do frameshift, donor/acceptor site, and 
nonsense mutations. Using the globin genes as examples, 
explain why this is so. 

3. Persons who have mutations that lower their production 
of both a- and [3-globin often present with milder disease 
symptoms than do those who have mutations lowering the 
production of only one type of chain. Why? 

4. Outline the major differences between SNPs, VNTRs, and 
STRs. Which of these three types of polymorphism is rep¬ 
resented in the autoradiogram in Figure 3-31? 

5. tti-Antitrypsin deficiency is a disease that arises when both 
copies of the ai-antitrypsin gene are altered by mutations. 
Liver disease, chronic emphysema, and pulmonary failure 
can result. One of the mutations that causes tti-antitrypsin 
deficiency occurs in exon 3 of the gene and destroys a 
recognition site for the restriction enzyme BstEII. RFLP 


analysis was performed on three members of a family, 
producing the autoradiogram in Figure 3-32. Determine 
the disease status of each individual. 

6. Using protein electrophoresis, 100 members of a popula¬ 
tion were studied to determine whether they carry genes 
for normal hemoglobin (HbA) or sickle hemoglobin 
(HbS). The following genotypes were observed: 

HbA/HbA: 88 
HbA/HbS: 10 
FIbs/Hbs: 2 

What are the gene frequencies of HbA and HbS? What 
are the observed genotype frequencies? Assuming Hardy- 
Weinberg proportions, what are the expected genotype 
frequencies? 

7. Approximately 1 in 10,000 Europeans is born with phe¬ 
nylketonuria. What is the frequency of the disease-causing 
allele? What is the frequency of heterozygous carriers in 
the population? 
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CHAPTER 


Autosomal Dominant and 
Recessive Inheritance 


Many important and well-understood genetic diseases are the 
result of a mutation in a single gene. The online edition of 
McKusick’s Mendelian Inheritance in Man (http://www 
.ncbi.nlm.nih.gOv/Omim/) lists nearly 15,000 single genes 
and nearly 8000 single-gene, or monogenic, traits defined 
thus far in humans. Of these 23,000 genes and traits, nearly 
21,000 are located on autosomes, more than 1,200 are located 
on the X chromosome, and 59 are located on the Y chromo¬ 
some. The identification of genes that cause monogenic traits 
has led to new and exciting insights not only in genetics but 
also in the basic pathophysiology of disease. 

In this chapter we focus on single-gene disorders caused 
by mutations on the autosomes. (Single-gene disorders 
caused by mutations on the sex chromosomes are the subject 
of Chapter 5.) We discuss the patterns of inheritance of these 
diseases in families, as well as factors that complicate these 
patterns. We also discuss the risks of transmitting single-gene 
diseases to one’s offspring, because this is usually an impor¬ 
tant concern for at-risk couples. 

BASIC CONCEPTS OF FORMAL GENETICS 
Gregor Mendel's Contributions 

Monogenic traits are also known as mendelian traits, after 
Gregor Mendel, the 19th-century Austrian monk who 
deduced several important genetic principles from his well- 
designed experiments with garden peas. Mendel studied 
seven traits in the pea, each of which is determined by a single 
gene. These traits included attributes such as height (tall 
versus short plants) and seed shape (smooth versus wrin¬ 
kled). The variation in each of these traits is caused by the 
presence of different alleles at individual loci. 

Several important principles emerged from Mendel’s 
work. The first, the principle of dominant and recessive 
inheritance, was discussed in Chapter 3. Mendel also discov¬ 
ered the principle of segregation, which states that sexually 
reproducing organisms possess genes that occur in pairs and 
that only one member of this pair is transmitted to the off¬ 
spring (i.e., it segregates). The prevalent thinking during 
Mendel’s time was that hereditary factors from the two 
parents are blended in the offspring. In contrast, the principle 
of segregation states that genes remain intact and distinct. An 
allele for “smooth” seed shape can be transmitted to an off¬ 
spring in the next generation, which can, in turn, transmit 
the same allele to its own offspring. If, instead of remaining 
distinct, genes were somehow blended in offspring, it would 
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be impossible to trace genetic inheritance from one genera¬ 
tion to the next. Thus, the principle of segregation was a key 
development in modern genetics. 

Mendel’s principle of independent assortment was 
another significant contribution to genetics. This principle 
states that genes at different loci are transmitted indepen¬ 
dently. Consider the two loci mentioned previously. One 
locus can have either the “smooth” or the “wrinkled” allele, 
and the other can have either the “taU” or the “short” allele. 
In a reproductive event, a parent transmits one allele from 
each locus to its offspring. The principle of independent 
assortment dictates that the transmission of a specific allele 
at one locus (“smooth” or “wrinkled”) has no effect on which 
allele is transmitted at the other locus (“tall” or “short”). 

The principle of segregation describes the behavior of 
chromosomes in meiosis. The genes on chromosomes segre¬ 
gate during meiosis, and they are transmitted as distinct enti¬ 
ties from one generation to the next. When Mendel performed 
his critical experiments, he had no direct knowledge of chro¬ 
mosomes, meiosis, or genes (indeed, the last term was not 
coined until 1909, long after Mendel’s death). Although his 
work was published in 1865 and cited occasionally, its funda¬ 
mental significance was unrecognized for several decades. Yet 
Mendel’s research, which was eventually replicated by other 
researchers at the turn of the 20th century, forms the founda¬ 
tion of much of modern genetics. 

Mendel's key contributions were the principles 
of dominance and recessiveness, segregation, 
and independent assortment. 

The Concept of Phenotype 

The term genotype has been defined as an individual’s 
genetic constitution at a locus. The phenotype is what is 
actually observed physically or clinically. Genotypes do not 
uniquely correspond to phenotypes. Individuals with two 
different genotypes, a dominant homozygote and a heterozy¬ 
gote, can have the same phenotype. An example is cystic 
fibrosis (Clinical Commentary 4-1), an autosomal recessive 
condition in which only the recessive homozygote is affected. 
Conversely, the same genotype can produce different pheno¬ 
types in different environments. An example is the recessive 
disease phenylketonuria (PKU, see Chapter 7), which is seen 
in approximately 1 of every 10,000 European births. Muta¬ 
tions at the locus encoding the metabolic enzyme phenylala¬ 
nine hydroxylase render the homozygote unable to metabolize 
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CLINICAL COMMENTARY 4-1 
Cystic Fibrosis 

Cystic fibrosis (CF) is one of the most common single-gene 
disorders in North America, affecting approximately 1 in 2000 
to 1 in 4000 European American newborns. The prevalence 
among African Americans is about 1 in 15,000 births, and it is 
less than 1 in 30,000 among Asian Americans. Approximately 
30,000 Americans have this disease. 

CF was first identified as a distinct disease entity in 1938 and 
was termed "cystic fibrosis of the pancreas." This refers to 
the fibrotic lesions that develop in the pancreas, one of the 
principal organs affected by this disorder (Fig. 4-1). Approxi¬ 
mately 85% of CF patients have pancreatic insufficiency, in 
which the pancreas is unable to secrete digestive enzymes, 
contributing to chronic malabsorption of nutrients. The intesti¬ 
nal tract is also affected, and approximately 15% to 20% of 
newborns with CF have meconium ileus (thickened, obstruc¬ 
tive intestinal matter). The sweat glands of CF patients are 
abnormal, resulting in high levels of chloride in the sweat. This 
is the basis for the sweat chloride test, commonly used in the 
diagnosis of this disease. More than 95% of males with CF are 
sterile due to absence or obstruction of the vas deferens. 

The major cause of morbidity and mortality in CF patients is 
pulmonary disease. Patients with CF have lower airway inflam¬ 
mation and chronic bronchial infection, progressing to end- 
stage lung disease characterized by extensive airway damage 
and fibrosis of lung tissue. Airway obstruction and lung injury 
are thought to be caused by a dehydrated airway surface and 


reduced clearance, resulting in thick airway mucus. This is 
associated with infection by bacteria such as Staphylococcus 
aureus and Pseudomonas aeruginosa. The combination of 
airway obstruction, inflammation, and infection leads to 
destruction of the airways and lung tissue, resulting eventually 
in death from pulmonary disease in more than 90% of CF 
patients. 

As a result of improved nutrition, airway clearance tech¬ 
niques, and antibiotic therapies, the survival rate of CF patients 
has improved substantially during the past 3 decades. Median 
survival time is now nearly 40 years. CF has highly variable 
expression, with some patients experiencing only mild respira¬ 
tory difficulty and nearly normal survival. Others have much 
more severe respiratory problems and may survive less than 2 
decades. 

CF is caused by mutations in a gene, CFTR* that encodes 
the cystic fibrosis transmembrane conductance regulator (Fig. 
4-2). CFTR encodes cyclic AMP-regulated chloride ion channels 
that span the membranes of specialized epithelial cells, such 
as those that line the bowel and lung. In addition, CFTR is 
involved in regulating the transport of sodium ions across epi¬ 
thelial cell membranes. The role of CFTR in sodium and chlo¬ 
ride transport helps us to understand the multiple effects of 
mutations at the CF locus. Defective ion transport results in 
salt imbalances, depleting the airway of water and producing 
the thick, obstructive secretions seen in the lungs. The 



FIG 4-1 A, Normal pancreas. B, Pancreas from a cystic fibrosis patient, showing infiltration of 
fat and fibrotic lesions. C, Normal lung tissue. D, Lung tissue from a cystic fibrosis patient, which 
shows extensive destruction as a result of obstruction and infection. (Courtesy Dr. Edward Klatt, 
Florida State University School of Medicine.) 


Continued 
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CLINICAL COMMENTARY 4-1 
Cystic Fibrosis—cont'd 


FIG 4-2 Classes of mutations in the CFTR gene and their effects in 
cells. Class I mutations result in no synthesis of the gene product. Class 
II mutations produce a defective protein product that is destroyed in 
proteasomes. Class III mutations produce a protein that gets to the cell 
surface but is abnormally regulated. Class IV mutations result in defec¬ 
tive chloride ion conductance. Class V mutations are typically promoter 
or intron-exon splicing mutations that reduce the number of mRNA 
transcripts, allowing some normal protein products. Class VI mutations 
result in increased rates of turnover of the chloride channel at the cell 
surface. 
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pancreas is also obstructed by thick secretions, leading to 
fibrosis and pancreatic insufficiency. The chloride ion transport 
defect explains the abnormally high concentration of chloride 
in the sweat secretions of CF patients: chloride cannot be 
reabsorbed from the lumen of the sweat duct. 

DNA sequence analysis has revealed nearly 2,000 different 
mutations at the CFTR locus. The most common of these, 
labeled F508del, is a three-base deletion that results in the loss 
of a phenylalanine residue (F) at position 508 of the CFTR 
protein. F508del accounts for nearly 70% of all CF mutations. 
This mutation, along with several dozen other relatively 
common ones, is assayed in the genetic diagnosis of CF (see 
Chapter 13). 

Identification of the specific mutation or mutations that are 
responsible for CF in a patient can help to predict the severity 
of the disease. For example, the most severe classes of muta¬ 
tions (of which F508del is an example) result in a complete lack 
of chloride ion channel production or in channels that cannot 
migrate to the cell membrane. Patients homozygous for these 
mutations nearly always have pancreatic insufficiency. In con¬ 
trast, other mutations (e.g., RUTH, a missense mutation) 
result in ion channels that do proceed to the cell membrane 
but respond poorly to cyclic AMP and consequently do not 
remain open as long as they should. The phenotype is thus 
milder, and patients who have this mutation are less likely to 
have pancreatic insufficiency. Some males with mild CFTR 
mutations have only congenital bilateral absence of the vas 
deferens (CBAVD) but little, if any, lung or gastrointestinal 
disease. The correlation between genotype and phenotype is 


far from perfect, however, indicating that modifier loci and 
environmental factors must also influence expression of the 
disease (see text). In general, there is a reasonably good cor¬ 
relation between genotype and pancreatic function and a more 
variable relationship between genotype and pulmonary 
function. 

The ability to identify CFTR mutations has led to surveys of 
persons who have one (heterozygous) or two (homozygous) 
CFTR mutations but who do not have cystic fibrosis. They have 
increased risks for a number of disease conditions, including 
CBAVD, bronchiectasis (chronic dilatation of the bronchi and 
abnormal mucus production), and pancreatitis (pancreatic 
inflammation). 

By enhancing our understanding of the pathophysiology of CF, 
identification of CFTR has opened the possibility of new treat¬ 
ments for this disease. Examples include administration of 
drugs that cause ribosomes to read through the premature stop 
codons that account for approximately 7% of CFTR mutations. 
Other drugs can increase the activity of chloride channels in 
patients with class III or IV mutations. The first FDA-approved 
drug for CF treatment, ivacaftor, increases CFTR channel activity 
in response to ATP. Clinical trials have demonstrated that this 
drug significantly improves lung function in patients with the 
class III G551D mutation seen in approximately 4% of CF 
patients. Gene therapy, in which the normal CFTR gene is placed 
in viral or other vectors that are then introduced to the patient's 
airway (see Chapter 1 3), is also being actively investigated. This 
strategy, however, has encountered difficulties because viral 
vectors often induce an inflammatory immune response. 


Conventionally, the symbol for a gene, such as CFTR, is shown in italics, and the symbol for the protein product is not. 
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the amino acid phenylalanine. Although habies with PKU are 
unaffected at birth, their metabolic deficiency produces a 
buildup of phenylalanine and its toxic metabolites. This 
process is highly destructive to the central nervous system, 
and it eventually produces severe mental impairment. It has 
been estimated that babies with untreated PKU lose, on 
average, 1 to 2 IQ points per week during the first year of life. 
Thus, the PKU genotype can produce a severe disease pheno¬ 
type. However, it is straightforward to screen for PKU at 
birth (see Chapter 13), and damage to the brain can be 
avoided by initiating a low-phenylalanine diet within 1 
month after birth. The child stUl has the PKU genotype, but 
the phenotype has been profoundly altered by environmental 
modification. 

This example shows that the phenotype is the result of the 
interaction of genotype and environmental factors. It should 
be emphasized that “environment” can include the genetic 
environment (i.e., genes at other loci whose products can 
interact with a specific gene or its product). 



The phenotype, which is physically observ¬ 
able, results from the interaction of genotype 
and environment. 


Basic Pedigree Structure 

The pedigree is one of the most commonly used tools in 
medical genetics. It illustrates the relationships among family 
members, and it shows which family members are affected or 
unaffected by a genetic disease. Typically, an arrow denotes 
the proband, the first person in whom the disease is diag¬ 
nosed in the pedigree. The proband is sometimes also referred 
to as the index case or propositus (proposita for a female). 
Figure 4-3 describes the features of pedigree notation. 

When discussing relatives in families, one often refers to 
degrees of relationship. First-degree relatives are those who 
are related at the parent-offspring or sibling (brother and 
sister) level. Second-degree relatives are those who are 
removed by one additional generational step (e.g., grandpar¬ 
ents and their grandchildren, uncles or aunts and their nieces 
or nephews). Continuing this logic, third-degree relatives 
would include, for example, one’s first cousins and 
great-grandchildren. 


AUTOSOMAL DOMINANT INHERITANCE 

Characteristics of Autosomal 
Dominant Inheritance 

Autosomal dominant diseases are seen in roughly 1 of every 
200 individuals (see Table 1-3 in Chapter 1). Individually, 
each autosomal dominant disease is rather rare in popula¬ 
tions, however, with the most common ones having gene 
frequencies of about 0.001. For this reason, matings between 
two individuals who are both affected by the same autosomal 
dominant disease are uncommon. Most often, affected off¬ 
spring are produced by the union of an unaffected parent 
with an affected heterozygote. The Punnett square in Figure 
4-4 illustrates such a mating. The affected parent can pass 
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Normal female 
Normal male 
Sex not specified 


Single bar indicates mating 


Normal parents and normal offspring, 
two girls and a boy, in birth order 
indicated by the numbers; I and II 
indicate generations 


Single parent as presented means 
partner is normal or of no significance to 
the analysis 


Double bar indicates a consanguineous 
mating (mating between close relatives) 


Fraternal twins (not identical) 


Identical twins 


© and I© Multiple individuals of each sex 

and Darkened square or circle means 

affected individual; arrow (when present) 
indicates the affected individual is 
the propositus (proband) 


0 


Carrier—not likely to manifest disease 



Dead 


Stillbirth at 29 weeks gestation 
SB 

29 wk 

FIG 4-3 Basic pedigree notation. (For further details, see 
Bennett RL, French KS, Resta RG, Doyle DL. Standardized 
human pedigree nomenclature: update and assessment of 
the recommendations of the National Society of Genetic 
Counselors. J Genet Couns. 2008:17:424-433.) 


either a disease allele or a normal allele to his or her children. 
Each event has a probability of 0.5. Thus, on the average, half 
of the children will be heterozygotes and will express the 
disease, and half will be unaffected homozygotes. 

Postaxial polydactyly, the presence of an extra digit next 
to the fifth digit (Fig. 4-5), can be inherited as an autosomal 
dominant trait. An idealized pedigree for this disease, shown 
in Figure 4-6, illustrates several important characteristics of 
autosomal dominant inheritance. First, the two sexes exhibit 
the trait in approximately equal ratios, and males and females 
are equally likely to transmit the trait to their offspring. This 
is because postaxial polydactyly is an autosomal disease (as 
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FIG 4-4 Punnett square illustrating the mating of an unaf¬ 
fected individual (aa) with an individual who is heterozygous 
for an autosomal dominant disease gene (Aa). The genotypes 
of affected offspring are shaded. 




aa aa aa Aa aa Aa 

FIG 4-6 Pedigree illustrating the inheritance pattern of post- 
axial Polydactyly, an autosomal dominant disorder. Affected 
individuals are represented by shading. 


opposed to a disease caused by an X chromosome mutation, 
in which these ratios typically differ). Second, there is no 
skipping of generations: if an individual has polydactyly, one 
parent must also have it. This leads to a vertical transmission 
pattern, in which the disease phenotype is usually seen in one 


generation after another. If neither parent has the trait, none 
of the children will have it. Third, father-to-son transmission 
of the disease gene is observed. Although father-to-son trans¬ 
mission is not required to establish autosomal dominant 
inheritance, its presence in a pedigree rules out some other 
modes of inheritance (particularly X-linked inheritance; see 
Chapter 5). Finally, as we have already seen, an affected het¬ 
erozygote transmits the disease-causing allele to approxi¬ 
mately half of his or her children. However, because gamete 
transmission, like coin tossing, is subject to chance fluctua¬ 
tions, it is possible that all or none of the children of an 
affected parent will have the trait. When large numbers of 
matings of this type are studied, the ratio of affected children 
closely approximates 1/2. 



Autosomal dominant inheritance is character¬ 
ized by vertical transmission of the disease 
phenotype, a lack of skipped generations, and 
roughly equal numbers of affected males and 
females. Father-to-son transmission may be 
observed. 


Recurrence Risks 

Parents at risk for producing children with a genetic disease 
typically want to know the risk, or probability, that their 
future children will be affected. This probability is termed the 
recurrence risk. If one parent is affected by an autosomal 
dominant disease and the other is unaffected, the recurrence 
risk for each child is X (assuming that the affected parent is 
a heterozygote, which is nearly always the case). It is impor¬ 
tant to keep in mind that each birth is an independent event, 
as in the coin-tossing examples. Thus, even if the parents have 
already had a child with the disease, their recurrence risk 
remains 1/2. Even if they have had several children, all affected 
(or all unaffected) by the disease, the law of independence 
dictates that the probability that their next child will have the 
disease is still 1/2. Although this concept seems intuitively 
obvious, it is commonly misunderstood by the lay popula¬ 
tion. Further aspects of communicating risks to families are 
discussed in Chapter 15. 



The recurrence risk for an autosomal dominant 
disorder is 50%. Because of independence, this 
risk remains constant no matter how many 
affected or unaffected children are born. 


AUTOSOMAL RECESSIVE INHERITANCE 

Like autosomal dominant diseases, autosomal recessive dis¬ 
eases are fairly rare in populations. As shown previously, het¬ 
erozygous carriers for recessive disease alleles are much more 
common than affected homozygotes. Consequently, the 
parents of individuals affected with autosomal recessive dis¬ 
eases are usually both heterozygous carriers. As the Punnett 
square in Figure 4-7 demonstrates, one fourth of the off¬ 
spring of two heterozygotes will be unaffected homozygotes, 
half will be phenotypically unaffected heterozygous carriers, 
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Carrier parent 



FIG 4-7 Punnett square illustrating the mating of two hetero¬ 
zygous carriers of an autosomal recessive gene. The geno¬ 
type of the affected offspring is shaded. 


and one fourth will be homozygotes affected with the disease 
(on average). 

Characteristics of Autosomal 
Recessive Inheritance 

Figure 4-8 is a pedigree showing the inheritance pattern of 
an autosomal recessive form of albinism that results from 
mutations in the gene that encodes tyrosinase, a tyrosine- 
metabolizing enzyme.* The resulting tyrosinase deficiency 
creates a block in the metabolic pathway that normally leads 
to the synthesis of melanin pigment. Consequently, the 
affected individual has very little pigment in the skin, hair, 
and eyes (Fig. 4-9). Because melanin is also required for the 
normal development of the optic fibers, persons with albi¬ 
nism can also display nystagmus (rapid uncontrolled eye 
movement), strabismus (deviation of the eye from its normal 
axis), and reduced visual acuity. The pedigree demonstrates 
most of the important criteria for distinguishing autosomal 
recessive inheritance (Table 4-1). First, unlike autosomal 
dominant diseases, in which the disease phenotype is seen in 
one generation after another, autosomal recessive diseases are 
usually observed in one or more siblings but not in earlier 
generations. Second, as in autosomal dominant inheritance, 
males and females are affected in equal proportions. Third, 
on average, one fourth of the offspring of two heterozygous 
carriers will be affected with the disorder. Finally, consan¬ 
guinity is present more often in pedigrees involving autoso¬ 
mal recessive diseases than in those involving other types of 
inheritance (see Fig. 4-8). The term consanguinity (Latin, 


*This form of albinism, termed tyrosinase-negative oculocutaneous 
albinism (OCAl), is distinguished from a second, mUder form 
termed tyrosinase-positive oculocutaneous albinism (OCA2). OCA2 
is typically caused by mutations in a gene on chromosome 15 whose 
protein product is thought to be involved in the transport and pro¬ 
cessing of tyrosinase. 



aa Aa aa AA 


FIG 4-8 Pedigree showing the inheritance pattern of 
tyrosinase-negative albinism, an autosomal recessive disease. 
Consanguinity in this pedigree is denoted by a double bar 
connecting the parents of the affected individuals. 



FIG 4-9 An African woman with oculocutaneous albinism, 
illustrating a lack of pigmentation in the hair and skin. She is 
looking away from the camera because her eyes are more 
sensitive to light than are those of persons with normally 
pigmented retinas. (Courtesy Dr. Phil Fischer, Mayo Clinic.) 
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TABLE 4-1 A Comparison of the Major 
Attributes of Autosomal Dominant and 
Autosomal Recessive Inheritance Patterns 


AUTOSOMAL 

AUTOSOMAL 

ATTRIBUTE 

DOMINANT 

RECESSIVE 

Usual recurrence 
risk 

50% 

25% 

Transmission 

Vertical; disease 

Disease phenotype 

pattern 

phenotype seen 

may be seen in 


in generation 

multiple siblings, 


after generation 

but usually not in 
earlier generations 

Sex ratio 

Equal number of 

Equal number of 


affected males 

affected males 


and females 

and females 


(usually) 

(usually) 

Other 

Father-to-son 

Consanguinity is 


transmission of 

sometimes seen, 


disease gene is 

especially for rare 


possible 

recessive 

diseases 


“with blood”) refers to the mating of related persons. It is 
sometimes a factor in recessive disease because related persons 
are more likely to share the same disease-causing alleles. Con¬ 
sanguinity is discussed in greater detail later in this chapter. 

Autosomal recessive inheritance is character¬ 
ized by clustering of the disease phenotype 
among siblings, but the disease is not usually 
seen among parents or other ancestors. Equal 
numbers of affected males and females 
are usually seen, and consanguinity may be 
present. 

Recurrence Risks 

As already discussed, the most common mating type seen in 
recessive disease involves two heterozygous carrier parents. 
This reflects the relative commonness of heterozygous carri¬ 
ers and the fact that many autosomal recessive diseases are 
severe enough that affected individuals are less likely to 
become parents. 

The Punnett square in Figure 4-7 demonstrates that one 
fourth of the offspring from this mating will be homozygous 
for the disease gene and therefore affected. The recurrence 
risk for the offspring of carrier parents is then 25%. As before, 
these are average figures. In any given family chance fluctua¬ 
tions are likely, but a study of a large number of families 
would yield a figure quite close to this fraction. 

Occasionally, a carrier of a recessive disease-causing allele 
mates with a person who is homozygous for this allele. In this 
case, roughly half of their children will be affected, and half 
will be heterozygous carriers. The recurrence risk is 50%. 
Because this pattern of inheritance mimics that of an auto¬ 
somal dominant trait, it is sometimes referred to as quasi¬ 
dominant inheritance. With studies of extended pedigrees in 



FIG 4-10 Achondroplasia. This girl has short limbs relative to 
trunk length. She also has a prominent forehead, low nasal 
root, and redundant skinfolds in the arms and legs. 

which matings of heterozygotes are observed, quasidominant 
inheritance can be distinguished from true dominant 
inheritance. 

When two persons affected by a recessive disease mate, all 
of their children must also be affected. This observation helps 
to distinguish recessive from dominant inheritance because 
two parents who are both affected by a dominant disease are 
almost always both heterozygotes and one fourth of their 
children, on average, will be unaffected. 

The recurrence risk for autosomal recessive 
diseases is usually 25%. Quasidominant inher¬ 
itance, with a recurrence risk of 50%, is seen 
when an affected homozygote mates with a 
heterozygote. 

"Dominant" Versus "Recessive"; Some Cautions 

The preceding discussion has treated dominant and recessive 
disorders as though they belong in rigid categories. However, 
these distinctions are becoming less strict as our understand¬ 
ing of these diseases increases. Most dominant diseases are 
actually more severe in affected homozygotes than in hetero¬ 
zygotes. An example is achondroplasia, an autosomal domi¬ 
nant disorder in which heterozygotes have reduced stature 
(Fig. 4-10). Heterozygotes have a nearly normal life span, 
estimated to be about 10 years less than average. Affected 
homozygotes are much more severely affected and usually die 
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in infancy from respiratory failure (see Chapter 10 for further 
discussion of achondroplasia). 

Although heterozygous carriers of recessive disease genes 
are clinically normal, the effects of recessive genes can often 
be detected in heterozygotes because they result, for example, 
in reduced levels of enzyme activity. This is usually the basis 
for biochemical carrier detection tests (see Chapter 13). A 
useful and valid way to distinguish dominant and recessive 
disorders is that heterozygotes are clinically affected in most 
cases of dominant disorders, whereas they are almost always 
clinically unaffected in recessive disorders. 

Although the distinction between dominant 
and recessive diseases is not rigid, a dominant 
disease allele will produce disease in a het¬ 
erozygote, whereas a recessive disease allele 
will not. 

Another caution is that a disease may be inherited in auto¬ 
somal dominant fashion in some cases and in autosomal 
recessive fashion in others. Familial isolated growth hormone 
deficiency (IGHD), another disorder that causes reduced 
stature, is one such disease. DNA sequencing of a pituitary 
growth hormone gene on chromosome 17 (GHl) has revealed 
a number of different mutations that can produce IGHD. 
Recessive IGFiD can be caused by nonsense, frameshift, or 
splice-site mutations that have a loss-of-function effect in 
which a mature protein product is not synthesized. Because 
they have one normal copy of GHl, heterozygotes still 
produce half of the normal amount of growth hormone. This 
is sufficient for normal stature. Fiomozygotes for these muta¬ 
tions produce no GH1 product and have reduced stature. 

Dominant IGHD can result from dominant negative 
mutations (see Chapter 3) in GHl. In one form of dominantly 
inherited IGHD, a splice-site mutation deletes the third exon 
of GHl, producing an abnormal protein that proceeds to the 
secretory granules. Here, the abnormal GHl product interacts 
with the normal protein encoded by the normal GHl allele. 
The abnormal molecules disable the normal growth hormone 
molecules, resulting in greatly reduced production of GHl 
product and thus reduced stature. 

Another example is given by [J-thalassemia, a condition 
discussed in Chapter 3. Although the great majority of 
(F-thalassemia cases occur as a result of autosomal recessive 
alleles, a small fraction of cases are inherited in autosomal 
dominant fashion. Some of these are caused by nonsense or 
frameshift mutations that terminate translation in exon 3 of 
the P-globin gene or in downstream exons. The resulting 
messenger RNA (mRNA) proceeds to the cytoplasm and pro¬ 
duces unstable [F-globin chains. In heterozygotes, these 
abnormal chains exert a dominant negative effect on the 
normal (J-globin chains produced by the normal allele. In 
contrast, frameshift or nonsense mutations that result in ter¬ 
mination of translation in exons 1 or 2 of the gene result in 
very little abnormal mRNA in the cytoplasm, leaving the 
product of the normal allele intact. In these cases, the hetero¬ 
zygote is unaffected. 


These examples illustrate some of the complexities 
involved in applying the terms “dominant” and “recessive.” 
They also show how molecular analysis of a gene can help to 
explain important disease features. 

In some cases, a disease may be inherited 
in either autosomal dominant or autosomal 
recessive fashion, depending on the nature of 
the mutation that alters the gene product. 

A final caution is that the terms dominant and recessive, 
strictly speaking, apply to traits, not genes. To see why, 
consider the sickle cell mutation, discussed in Chapter 3. 
Homozygotes for this mutation develop sickle cell disease. 
Heterozygotes, who are said to have sickle cell trait, are usually 
clinically normal. However, a heterozygote has an increased 
risk for splenic infarctions at very high altitude. Is the mutant 
gene then dominant or recessive? Clearly, it makes more 
sense to refer to sickle cell disease as recessive and sickle 
cell trait as dominant. Nonetheless, it is common (and often 
convenient) to apply the terms dominant and recessive to 
genes. 

FACTORS THAT AFFECT EXPRESSION OF 
DISEASE-CAUSING GENES 

The inheritance patterns described previously for conditions 
like postaxial polydactyly, cystic fibrosis, and albinism are 
quite straightforward. However, most genetic diseases vary in 
their degree of expression, and sometimes a person has a 
disease-causing genotype but never manifests the phenotype. 
Genetic diseases are sometimes seen in the absence of any 
previous family history. These phenomena, and the factors 
responsible for them, are discussed next. 

New Mutation 

If a child has been born with a genetic disease that has not 
occurred previously in the family, it is possible that the disease 
is the product of a new (or de novo) mutation. That is, the 
gene transmitted by one of the parents underwent a change 
in DNA sequence, resulting in a mutation from a normal to 
a disease-causing allele. The alleles at this locus in the parent’s 
other germ cells would still be normal. In this case the recur¬ 
rence risk for the parents’ subsequent offspring would not be 
elevated above that of the general population. However, the 
offspring of the affected child might have a substantially 
elevated risk (e.g., it would be 50% for an autosomal domi¬ 
nant disease). A large fraction of the observed cases of many 
autosomal dominant diseases are the result of new mutations. 
For example, approximately 7/8 of all cases of achondroplasia 
are caused by new mutations, and only 1/8 are inherited from 
an affected parent. This is primarily because the disease tends 
to limit the potential for reproduction. To provide accurate 
risk estimates, it is essential to know whether a patient’s 
disease is due to an inherited mutation or a new mutation. 
This can be done only if an adequate family history has been 
taken. 
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New mutations are a common cause of the 
appearance of a genetic disease in a person 
with no previous family history of the disorder. 
The recurrence risk for the person's siblings is 
very low, but the recurrence risk for the per¬ 
son's offspring may be substantially increased. 

Germline Mosaicism 

Occasionally, two or more offspring present with an autoso¬ 
mal dominant or X-linked disease when there is no family 
history of the disease. Because mutation is a rare event, it is 
unlikely that this would be due to multiple new mutations in 
the same family. The mechanism most likely to be responsible 
is termed germline mosaicism (mosaicism describes the 
presence of more than one genetically distinct cell line in 
the body). During the embryonic development of one of the 
parents, a mutation occurred that affected all or part of the 
germline but few or none of the somatic cells of the embryo 
(Fig. 4-11). Thus, the parent carries the mutation in his or 
her germline but does not actually express the disease because 


Embryo 


* -Mutation 



cells 

* 

FIG 4-11 A mutation occurs in one cell of the developing 
embryo. All descendants of that cell have the same mutation, 
resulting in mosaicism. If the first mutated cell is part of the 
germline lineage, then germline mosaicism results. 


the mutation is absent in other cells of the body. As a result, 
the parent can transmit the mutation to multiple offspring. 
Although this phenomenon is relatively rare, it can have sig¬ 
nificant effects on recurrence risks when it does occur. 

Germline mosaicism has been studied extensively in the 
lethal perinatal form of osteogenesis imperfecta (01 type 11; 
see Chapter 2), which is caused by mutations in the type 1 
procollagen genes. The fact that unaffected parents some¬ 
times produced multiple offspring affected with this disease 
led to the conclusion that type 11 01 was an autosomal reces¬ 
sive trait. This was disputed by studies in which the poly¬ 
merase chain reaction (PCR) technique was used to amplify 
DNA from the sperm of a father of two children with type 11 
OI. This DNA was compared with DNA extracted from his 
somatic cells (skin fibroblasts). Although procollagen muta¬ 
tions were not detected in the fibroblast DNA, they were 
found in approximately one of every eight sperm cells. This 
was a direct demonstration of germline mosaicism in this 
man. Although germline mosaicism has been demonstrated 
for type II OI, most noninherited cases (approximately 95%) 
are thought to be caused by new mutations. Some cases of 
true autosomal recessive inheritance have also been docu¬ 
mented, and genes that can cause autosomal recessive OI have 
been identified (Clinical Commentary 2-1, Chapter 2). 

Other diseases in which germline mosaicism has been 
observed include achondroplasia, neurofibromatosis type 1, 
Duchenne muscular dystrophy, and hemophilia A (the latter 
two diseases are discussed in Chapter 5). It has been estimated 
that germline mosaicism accounts for up to 15% of Duch¬ 
enne muscular dystrophy cases and 20% of hemophilia A 
cases in which there is no previous family history. 

Germline mosaicism occurs when all or part of 
a parent's germline is affected by a disease- 
causing mutation but the somatic cells are not. 
It elevates the recurrence risk for offspring of 
the mosaic parent. 

Reduced Penetrance 

Another important characteristic of many genetic diseases is 
reduced (or incomplete) penetrance: a person who has a 
disease-causing genotype might not exhibit the disease phe¬ 
notype at all, even though he or she can transmit the disease- 
causing allele to the next generation. Retinoblastoma, a 
malignant eye tumor (Clinical Commentary 4-2), is a good 
example of an autosomal dominant disorder in which reduced 
penetrance is seen. The transmission pattern of this disorder 
is illustrated in Figure 4-12. Family studies have shown that 
about 10% of the obligate carriers of a retinoblastoma- 
causing allele (i.e., those who have an affected parent and 
affected children and therefore must themselves carry the 
allele) do not have the disease. The penetrance of the disease- 
causing genotype is then said to be 90%. Penetrance rates are 
usually estimated by examining a large number of families 
and determining what percentage of the obligate carriers (or 
obligate homozygotes, in the case of recessive disorders) 
develop the disease phenotype. 
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Aa aa Aa aa Aa Aa 


FIG 4-12 Pedigree illustrating the inheritance pattern of reti¬ 
noblastoma, a disorder with reduced penetrance. The unaf¬ 
fected obligate carrier, denoted by a dot, has the same 
genotype as the affected pedigree members. 


Reduced penetrance describes the situation 
in which persons who have a disease- 
causing genotype do not develop the disease 
phenotype. 

Age-Dependent Penetrance 

Although some genetic diseases are expressed at birth or 
shortly afterward, many others do not become apparent until 
well into adulthood. A delay in the age of onset of a genetic 
disease is known as age-dependent penetrance. One of the 
best-known examples is Huntington disease, a neurological 
disorder whose main features are progressive dementia and 
increasingly uncontrollable movements of the limbs (Clinical 
Commentary 4-3). The latter feature is known as chorea 
(from the Greek word for “dance,” khoreia), and the disease 
is sometimes called Huntington chorea. This autosomal 
dominant disorder is named after Dr. George Huntington, 
who first described the disease in 1872. Symptoms are not 
usually seen until age 30 years or later (Fig. 4-13). Thus, those 
who develop the disease often have children before they are 
aware that they carry the disease-causing allele. If the disease 
were present at birth, nearly all affected persons would die 
before reaching reproductive age, and the frequency of the 
disease in the population would be much lower. Delaying the 
age of onset of the disease thus reduces natural selection 
against a disease-causing allele, increasing its frequency in a 
population. 

A person whose parent has Huntington disease has a 50% 
chance of inheriting the disease allele. Until recently, this 
person would be confronted with a torturous question: 
Should 1 have children, knowing that there is a 50% chance 
that 1 have inherited this mutation and pass it to half of my 
children? With the identification of the gene responsible for 
Huntington disease, it is now possible for at-risk persons to 



Age of onset 

FIG 4-13 Distribution of the age of onset for Huntington 
disease. The age of onset tends to be somewhat earlier 
when the affected parent is the father. (Data from Conneally 
PM. Huntington disease: genetics and epidemiology. Am J 
Hum Genet. 1984:36:520.) 

know with a high degree of certainty whether they carry a 
disease-causing allele. 

As mentioned previously, a number of important genetic 
diseases exhibit age-dependent penetrance. These include 
hemochromatosis, a recessive disorder of iron storage (see 
Chapter 7); familial Alzheimer disease (see Chapter 12); and 
many inherited cancers, including autosomal dominant 
breast cancer. ' 

Age-dependent penetrance is observed in 
many genetic diseases. It complicates the inter¬ 
pretation of inheritance patterns in families. 

Variable Expression 

Penetrance and expression are distinct entities. Penetrance is 
an all-or-none phenomenon: one either has the disease phe¬ 
notype or does not. Variable expression refers to the degree 
of severity of the disease phenotype. 

The severity of expression of many genetic diseases can 
vary greatly. A well-studied example of variable expression 
in an autosomal dominant disease is neurofibromatosis type 
1 (Clinical Commentary 4-4). A parent with mild expression 
of the disease—so mild that he or she is not aware of it— 
can transmit the disease-causing allele to a child, who may 
have severe expression. Many factors can affect the expres¬ 
sion of a genetic disease, including environmental (i.e., non- 
genetic) influences such as diet, exercise, or exposure to 
harmful agents such as tobacco smoke. In the absence of a 
certain environmental factor, the disease-causing gene is 
expressed with diminished severity or not at all (e.g., the 
diminished expression of PKU under a low-phenylalanine 
diet). Another factor that can influence disease expression is 
the interaction of other genes, called modifier genes, with 


^Epidemiological studies indicate that about 5% of breast cancer 
cases in the United States are caused by genes inherited in autosomal 
dominant fashion. See Chapters 11 and 12 for further discussion. 
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CLINICAL COMMENTARY 4-2 
Retinoblastoma 

Retinoblastoma is the most common childhood eye tumor, 
affecting approximately 1 in 20,000 children. The tumor typi¬ 
cally initiates between 3 months after conception and 4 years 
of age, when retinal cells are actively dividing and proliferating. 
It nearly always presents clinically by the age of 5 years. 

Approximately 60% of retinoblastoma cases are caused 
by somatic mutations that occur in early development and 
are therefore not transmitted to the affected individual's 
offspring. The remaining 40% are caused by inherited muta¬ 
tions: About 3/4 of these (30% of total cases) are the result 
of new mutations, most often transmitted by the father. The 
other 1/4 of inherited cases (10% of the total) are inherited 
from a parent who carries a retinoblastoma-causing muta¬ 
tion in all of his or her cells. About 10% of those who have 
inherited a disease-causing mutation never develop a tumor 
(reduced penetrance). 

The analysis of changes in DNA in and near the disease- 
causing gene, RBI, has explained the mechanism responsible 
for reduced penetrance. Briefly, an individual who has inherited 
a disease-causing RBI mutation carries the mutation in every 
cell of his or her body. However, this is not sufficient to cause 
tumor formation (if it were, every cell in the body would give 
rise to a tumor). In any cell, the presence of one normal RBI 
allele is sufficient to prevent tumor formation. To initiate a 
tumor in a developing retinal cell, a second somatic event must 
occur that disables the other, normal RBI allele (this "two-hit" 
process is discussed further in Chapter 1 1). The second event, 
which can be considered a somatic mutation, has a relatively 
low probability of occurring in any given cell. However, there 
are at least 1 million retinal cells in the developing fetus, each 
representing a potential target for the event. Usually, an indi¬ 
vidual who has inherited one disease-causing mutation will 
experience a second somatic mutation in several different 
retinal cells, giving rise to several tumors. Inherited retinoblas¬ 
toma is thus usually multifocal (consisting of several tumor foci) 
and bilateral (affecting both eyes). Because the second muta¬ 
tions are random events, a small fraction of persons who 
inherit the disease-causing allele never experience a second 
mutation in any retinal cell and they do not develop a retino¬ 
blastoma. The requirement for a second somatic mutation thus 
explains the reduced penetrance seen in this disorder. 

The retinoblastoma gene, RBI, encodes a protein product, 
pRb, that plays a key role in cell cycle control. A major function 
of pRb, when hypophosphorylated, is to bind and inactivate 
members of the E2F family of nuclear transcription factors. The 
cell requires active E2F to proceed from the G1 to the S phase 
of mitosis. By inactivating E2F, pRb applies a brake to the ceil 
cycle. When cell division is required, pRb is phosphorylated by 
cyclin-dependent kinase complexes (see Chapter 2) and E2F is 
released by pRb and activated. A loss-of-function mutation in 
pRb can cause a permanent loss of E2F-binding capacity. The 


cell, having lost its brake, will undergo repeated, uncontrolled 
mitosis, potentially leading to a tumor. Because of its control¬ 
ling effect on the cell cycle, the RBI gene belongs to a class 
of genes known as tumor suppressors (see Chapter 11). 

If untreated, retinoblastomas can grow to considerable size 
and can metastasize to the central nervous system or other 
organ systems. Fortunately, these tumors are now usually 
detected and treated before they progress. If found early 
enough through ophthalmological examination, the tumor may 
be treated successfully with cryotherapy (freezing) or laser 
photocoagulation. In more advanced cases, radiation, chemo¬ 
therapy, or enucleation (removal) of the eye may be necessary. 
Currently, the 5-year survival rate for retinoblastoma patients 
in the United States is nearly 95%. 

Because persons with familial retinoblastoma have inherited 
an RBI mutation in all cells of their body, they are also suscep¬ 
tible to other types of cancer later in life. In particular, about 
15% of those who inherit an RBI mutation later develop osteo¬ 
sarcomas (malignant bone tumors). Other common second 
cancers include soft tissue sarcomas and cutaneous melano¬ 
mas. Careful monitoring for subsequent tumors and avoidance 
of agents that could produce a second mutation (e.g., x-rays) 
are thus important aspects of management for the patient with 
inherited retinoblastoma (Fig. 4-14). 



FIG 4-14 A, A white reflex (leukocoria) can be seen in the 
right eye of this individual on ophthalmoscopic examination. 
B, Bilateral retinoblastoma, showing presence of neoplastic 
tissue. (From Rosai J. Ackerman's Surgical Pathology. 8th 
ed. St Louis: Mosby; 1996.) 
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CLINICAL COMMENTARY 4-3 
Huntington Disease 

Huntington disease (HD) affects approximately 1 in 20,000 
persons of European descent but is substantially less common 
in most other populations The disorder usually manifests 
between the ages of 30 and 50 years, although it has been 
observed as early as 1 year of age and as late as 80 years 
of age. 

HD is characterized by a progressive loss of motor control, 
dementia, and psychiatric disorders. There is a substantial loss 
of neurons in the brain, which is detectable by techniques 
such as magnetic resonance imaging (MRI). Decreased glucose 
uptake in the brain, an early sign of the disorder, can be 
detected by positron-emission tomography (PET). (These tech¬ 
niques, however, are not specific enough to diagnose the dis¬ 
order accurately.) Although many parts of the brain are affected, 
the area most noticeably damaged is the corpus striatum. In 
some patients, the disease leads to a loss of 25% or more of 
total brain weight. 

The clinical course of HD is protracted, and the interval from 
initial diagnosis to death is typically 15 to 20 years. As in many 
neurological disorders, patients with HD experience difficulties 
in swallowing; aspiration pneumonia is the most common 
cause of death. Cardiorespiratory failure and subdural hema¬ 
toma (due to head trauma) are other frequent causes of death. 
The suicide rate among HD patients is 5 to 10 times higher 
than in the general population. Treatment includes drugs such 
as benzodiazepines to help control the choreic movements. 
Affective disturbances, which are seen in nearly half of the 
patients, are sometimes controlled with antipsychotic drugs 
and antidepressants. Although these drugs help to control 
some of the symptoms of HD, there is currently no way to alter 
the outcome of the disease. 

HD has the distinction of being the first genetic disease 
mapped to a specific chromosome using an RFLP marker, in 
1983. Subsequent sequencing of the disease-causing gene 
showed that the great majority of HD cases are caused by a 
CAG expanded repeat (see Chapter 3) located in exon 1. In 
90% to 95% of cases, the mutation is inherited from an 
affected parent. In the general population, the number of CAG 
tandem repeats ranges from 10 to 26. Persons with 27 to 35 
repeats are unaffected but are more likely to transmit a still 
larger number of repeats to their offspring. The inheritance of 
36 or more copies of the repeat can produce disease, although 
incomplete penetrance of the disease phenotype is seen in 
those who have 36 to 40 repeats. As in many disorders caused 
by trinucleotide repeat expansion, a larger number of repeats 
is correlated with earlier age of onset of the disorder. About 
60% to 70% of the variation in age of onset of HD can be 
predicted by repeat number. There is a tendency for greater 
repeat expansion when the father, rather than the mother, 
transmits the disease-causing mutation, which helps to explain 
the difference in ages of onset for maternally and paternally 
transmitted disease seen in Figure 4-13. In particular, 80% of 
cases with onset before 20 years of age (juvenile Huntington 
disease) are due to paternal transmission, and these cases are 
characterized by especially large repeat expansions. It remains 



FIG 4-15 Two cross sections of the brain of an adult with 
Huntington disease, which illustrate severe caudate atrophy 
and enlarged lateral ventricles. (Courtesy Dr. Thomas Bird, 
University of Washington.) 


unknown why the degree of repeat instability in the HD gene 
is greater in paternal transmission than in maternal transmis¬ 
sion (Fig. 4-15). 

The protein product, termed huntingtin, is involved in the 
transport of vesicles in cellular secretory pathways. In addition, 
there is evidence that huntingtin is necessary for the normal 
production of brain-derived neurotrophic factor. The CAG 
repeat expansion produces a lengthened series of glutamine 
residues near huntingtin's amino terminal. Although the precise 
role of the expanded glutamine tract in disease causation is 
unclear, it is correlated with a buildup of toxic protein aggre¬ 
gates within and near neuronal nuclei. These aggregates are 
thought to be toxic and are associated with early neuronal 
death. HD is notable in that affected homozygotes appear to 
display a clinical course very similar to that of heterozygotes 
(in contrast to most dominant disorders, in which homozygotes 
are more severely affected). This attribute, and the fact that 
mouse models in which one copy of the gene is inactivated 
are perfectly normal, supports the hypothesis that the mutation 
causes a harmful gain of function (see Chapter 3). 
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the disease-causing gene. Finally, variable expression can 
result from different types of mutations (i.e., different alleles) 
at the same disease locus. This is termed allelic heterogene¬ 
ity. Efforts are often made to establish genotype-phenotype 
correlations to better predict the severity of a genetic disease, 
given the patient’s genotype. In some cases, clinically distinct 
diseases may be the result of allelic heterogeneity, as in the 
[J-globin mutations that can cause either sickle cell disease 
or various forms of (J-thalassemia. 

Cystic fibrosis, discussed in Clinical Commentary 4-1, 
illustrates ways these factors can influence disease severity. 
CFTR mutations that result in a complete absence of chloride 
channels on cell surfaces tend to produce more-severe disease 
than do mutations that result in partially active chloride ion 
channels (allelic heterogeneity). Some of the variation in 
severity of lung disease among CF patients with identical 
CFTR genotypes can be explained by variation in the TGFBl 
gene (transforming growth factor (3), a modifier gene whose 
increased expression is also associated with asthma and 
chronic obstructive pulmonary disease. Mutations in another 
modifier gene, DCTN4, appear to reduce the effectiveness of 
the dynactin protein in clearing Pseudomonas aeruginosa 
from the airway. Patients with CF who suffer from more- 
frequent and severe bacterial infections experience acceler¬ 
ated lung damage. This disease thus provides examples of all 
three major causes of variable expression: allelic heterogene¬ 
ity, modifier genes, and nongenetic environmental factors 
(bacterial infections). 

Because of the many factors that can influence the expres¬ 
sion of a genetic disease, it should be apparent that the com¬ 
monly used term “single-gene disease” is an oversimplification. 
Although a mutation in a single gene may be sufficient to 
cause such a disease, its severity—always an important 


concern for clinicians—is typically influenced by many 
genetic and nongenetic factors. 

Variable expression of a genetic disease may 
be caused by environmental effects, modifier 
genes, or allelic heterogeneity. 

Locus Heterogeneity 

Quite commonly, a single disease phenotype can be caused 
by mutations at different loci in different families, which is 
termed locus heterogeneity (compare with allelic heteroge¬ 
neity, discussed in the previous section, in which different 
mutations are seen within the same disease locus). A good 
example is adult polycystic kidney disease (APKD), an auto¬ 
somal dominant disorder in which a progressive accumula¬ 
tion of renal cysts is seen. Patients can also develop liver cysts, 
hypertension, cerebral aneurysms, and cardiac valvular 
defects. Occurring in about 1 of every 1000 persons of Euro¬ 
pean descent, this disorder is responsible for 8% to 10% of 
end-stage renal disease in North America. APKD can be 
caused by mutations in genes on either chromosome 16 
(PKDl) or chromosome 4 (PKD2). Both of these genes 
encode membrane-spanning glycoproteins that interact with 
one another and may be involved in cellular signaling. (When 
this signaling goes awry, it is thought that cellular growth 
regulation is compromised, resulting in cyst formation.) In 
one family, the disease may be caused by a PKDl mutation, 
whereas in another family it may be caused by a PKD2 muta¬ 
tion. The disease states produced by mutations in these two 
genes may be clinically indistinguishable. 

Osteogenesis imperfecta provides a second example of 
locus heterogeneity. Recall from Chapter 2 that the subunits 
of the procollagen triple helix are encoded by two genes, one 



CLINICAL COMMENTARY 4-4 

Neurofibromatosis: A Disease with Highly Variable Expression 

Neurofibromatosis type 1 (NF1) is also known as von Reckling¬ 
hausen disease, recognizing the German physician who first 
described this condition in 1882. It is one of the more common 
autosomal dominant disorders, affecting approximately 1 in 
3000 individuals in all studied populations. NF1 provides a 
good example of variable expression in a genetic disease. 
Some patients have only cafe-au-lalt spots (French for "coffee 
with milk," describing the color of the hyperpigmented skin 
patches), Lisch nodules (benign growths on the iris), and a 
few neurofibromas (nonmalignant peripheral nerve tumors). 
These persons are sometimes unaware that they have the 
condition. Other patients have a much more severe expression 
of the disorder, including hundreds to thousands of neurofibro¬ 
mas, plexiform neurofibromas, optic pathway gliomas (benign 
tumors of the optic nerve), learning disabilities, hypertension, 
scoliosis (lateral curvature of the spine), and malignancies. 
About two thirds of patients have only a mild cutaneous 
involvement. Approximately 10% develop malignant peripheral 
nerve sheath tumors (MPNSTs), which typically arise from 
plexiform neurofibromas. Expression can vary significantly 


within the same family; a mildly affected parent can produce a 
severely affected offspring. 

A standard set of diagnostic criteria for NF1 has been devel¬ 
oped. Two or more of the following must be present: 

1. Six or more cafe-au-lait spots greater than 5 mm in diameter 
in prepubertal patients and greater than 15 mm in postpu- 
bertal patients 

2. Freckling in the armpits or groin area 

3. Two or more neurofibromas of any type or one plexiform 
neurofibroma (i.e., an extensive growth that occurs along a 
large nerve sheath) 

4. Two or more Lisch nodules 

5. Optic glioma 

6. Distinctive bone lesions, particularly an abnormally formed 
sphenoid bone or tibial pseudarthrosis* 

7. A first-degree relative with neurofibromatosis diagnosed 
using the previous six criteria 

Although NF1 has highly variable expression, the penetrance 
of disease-causing mutations is virtually 100%. Approximately 
50% of patients with NF1 have the condition because of a new 
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CLINICAL COMMENTARY 4-4 

Neurofibromatosis: A Disease with Highly Variable Expression—cont'd 

mutation. In fact, the NF1 gene has one of the highest known 
locus-specific mutation rates, about 1 in 10,000 per generation. 
This is due in part to the large size of the NF1 gene (approxi¬ 
mately 350 kb), making it a large target for mutation. The gene 
product, neurofibromin, acts as a tumor suppressor (see 
Chapter 11 for further details). NF1 mutations can be detected 
in more than 90% of cases, using a combination of detection 
methods including DNA sequencing, cytogenetic analysis, and 
analysis of abnormal (truncated) products. Persons in whom 
the entire NF1 gene is deleted tend to be severely affected, 
with large numbers of neurofibromas and an increased risk of 
developing MPNSTs. 

An NF1 mutation that occurs during embryonic development 
will affect only some cells of the individual, resulting in somatic 
mosaicism. In this case, the disease features may be confined 
to only one part of the body (segmental neurofibromatosis). 

Neurofibromatosis type 2 (NF2) is much rarer than NF1 and 
is characterized by vestibular schwannomas (tumors that arise 
in Schwann cells and affect the eighth cranial nerve) and, occa¬ 
sionally, cafe-au-lait spots. Patients who have NF2 do not, 
however, have true neurofibromas, so the term "neurofibro¬ 
matosis type 2" is a misnomer. The NF2 gene encodes a tumor 
suppressor protein called merlin or schwannomin. 

Mild cases of neurofibromatosis can require very little clinical 
management. Flowever, surgery may be required if malignan¬ 
cies develop or if benign tumors interfere with normal function. 
Scoliosis, tibial pseudarthrosis, and/or tibial bowing, seen in 
less than 5% of cases, can require orthopedic management. 
Flypertension can develop and is often secondary to a pheo- 
chromocytoma or a stenosis (narrowing) of the renal artery. 
The most common clinical problems in children are learning 
disabilities (seen in about 50% of persons with NF1), short 
stature, and optic gliomas (which can lead to vision loss). Close 
follow-up can help to detect these problems and minimize their 
effects. Recent clinical trials designed to reduce or eliminate 
the tumors seen in patients with NF1 have provided hope for 
better treatment options (Fig. 4-16). 



FIG 4-16 Neurofibromatosis type 1 (NF1). A, Multiple 
neurofibromas in an adult with type 1 neurofibromatosis. 
B, Lisch nodules (benign hamartomas of the iris) visible on 
a slit-lamp examination of an individual with type 1 neuro¬ 
fibromatosis. (A from Habif T, Campbell J, Chapman M, 
et al. Skin Disease: Diagnosis and Treatment. 2nd ed. 
St. Louis: Mosby: 2005; B from Jones KL. Smith's Recog¬ 
nizable Patterns of Human Malformation. 6th ed. Philadel¬ 
phia: Mosby: 2006.) 


* Pseudarthrosis can occur when a long bone, such as the tibia, undergoes a loss of bone cortex, leading to weakening and fracture. 
Abnormal callus formation causes a false joint in the bone, leading to the term (arthron = "joint"). 






74 


CHAPTER 4 Autosomal Dominant and Recessive Inheritance 


COL1A1 

COL1A2 

FIG 4-17 Structure of the triple helix type 1 collagen protein. 
The two tti chains are encoded by a gene on chromosome 17, 
and the a 2 chain is encoded by a gene on chromosome 7. 

on chromosome 17 and the other on chromosome 7 (Fig. 
4-17). A mutation occurring in either of these genes can alter 
the normal structure of the triple helix, resulting ultimately in 
osteogenesis imperfecta. Locus heterogeneity is seen in many 
other genetic diseases, including familial forms of Alzheimer 
disease, breast cancer, colon cancer, and melanoma. 

A disease that can be caused by mutations at 
different loci in different families is said to 
exhibit locus heterogeneity. 

Pleiotropy 

Genes that have more than one discernible effect on the body 
are said to be pleiotropic. A good example of a gene with 
pleiotropic effects is given by Marfan syndrome. First 
described in 1896 by Antoine Marfan, a French pediatrician, 
this autosomal dominant disorder affects the eye, the skele¬ 
ton, and the cardiovascular system (Clinical Commentary 
4-5). Most of the observed features of Marfan syndrome are 
caused by unusually stretchable connective tissue. The great 
majority of Marfan syndrome cases are caused by mutations 
in the gene that encodes fibrillin, a component of connective 
tissue that is expressed in most of the tissues and organs 
affected by Marfan syndrome (see Clinical Commentary 4-5). 
We have already discussed several other single-gene diseases 
in which pleiotropy is seen, including cystic fibrosis, in which 
sweat glands, lungs, and pancreas can be affected; osteogen¬ 
esis imperfecta, in which bones, teeth, and sclerae can be 
affected; and albinism, in which pigmentation and optic fiber 
development are affected. 

Genes that exert effects on multiple aspects of 
physiology or anatomy are pleiotropic. Pleio¬ 
tropy is a common feature of human genes. 

CONSANGUINITY IN HUMAN POPULATIONS 

Although consanguinity is relatively rare in Western popula¬ 
tions, it is common in many populations of the world. For 
example, first-cousin unions are seen in 20% to 50% of mar¬ 
riages in many countries of the Middle East, and uncle-niece 
and first-cousin marriages occur in some parts of India. 
Because relatives more often share disease genes inherited 
from a common ancestor, consanguineous unions are more 
likely to produce offspring affected by autosomal recessive 
disorders. It is possible to quantify the percentage of DNA 


sequence shared by a pair of relatives by estimating the coef¬ 
ficient of relationship (Box 4-1). Estimation of this quantity 
shows, for example, that siblings share 1/2 of their DNA 
sequences, on average, because they share two parents. Uncles 
and nieces share 1/4 of their DNA sequences because of 
common ancestry, first cousins share 1/8, first cousins once 
removed share 1/16, second cousins share 1/32, and so on.* 

Consanguinity and the Frequency 
of Recessive Diseases 

Recall that about 1 in 25 whites is a heterozygous carrier of a 
mutation that causes cystic fibrosis. A man who carries this 
allele thus has a 1 in 25 chance of meeting another carrier if 
he mates with somebody in the general population. Lie only 
triples his chance of meeting another carrier if he mates with 
a first cousin, who has a 1/8 chance of carrying the same gene. 
In contrast, a carrier of a relatively rare recessive disease, such 
as classic galactosemia (a metabolic disorder discussed in 
Chapter 7), has only a 1/170 chance of meeting another 
carrier in the general population. Because he shares 1/8 of his 
DNA with his first cousin, the chance that his first cousin also 
has a galactosemia mutation is still 1/8. With this rarer disease, 
a carrier is 21 times more likely to mate with another carrier 
in a first-cousin marriage than in a marriage with an unre¬ 
lated individual. This illustrates an important principle: the 
rarer the recessive disease, the more likely that the parents of 
an affected individual are consanguineous. 

This principle has been substantiated empirically. A Erench 
study showed that the frequency of first-cousin marriages in 
that country was less than 0.2%. Among patients with cystic 
fibrosis, a relatively common recessive disorder, 1.4% were 
the offspring of first-cousin matings. This percentage rose to 
7.1% for cystinosis and 12.5% for achromatopsia, both of 
which are less-common recessive disorders. 

Consanguinity increases the chance that a 
mating couple will both carry the same disease- 
causing mutation. It is seen more often in pedi¬ 
grees involving rare recessive diseases than in 
those involving common recessive diseases. 

Health Consequences of Consanguinity 

It has been estimated that each person carries the equivalent 
of one to five recessive mutations that would be lethal to 
offspring if matched with another copy of the mutation (i.e., 
homozygosity). It would therefore be expected that matings 
between relatives would more often produce offspring with 
genetic diseases. In fact, most empirical studies do show that 
mortality rates among the offspring of first-cousin marriages 
are substantially greater than those of the general population 
(Table 4-2). Similarly, the prevalence of genetic disease is 
roughly twice as high among the offspring of first-cousin 


*First cousins are the offspring of two siblings and thus share a set 
of grandparents. A first cousin once removed is the offspring of one’s 
first cousin. Second cousins are the offspring of two different first 
cousins and thus share a set of great-grandparents. 
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CLINICAL COMMENTARY 4-5 
Marfan Syndrome: An Example of Pleiotropy 

Marfan syndrome is an autosomal dominant condition seen in 
approximately 1 of every 10,000 North Americans. It is charac¬ 
terized by defects in three major systems: ocular, skeletal, and 
cardiovascular. The ocular defects include myopia, which is 
present in most patients with Marfan syndrome, and displaced 
lens (ectopia lentis), which is observed in about half of Marfan 
syndrome patients. The skeletal defects include dolichosteno- 
melia (unusually long and slender limbs), pectus excavatum 
("hollow chest"), pectus carinatum ("pigeon chest"), scoliosis, 
and arachnodactyly (literally "spider fingers," denoting the 
characteristically long, slender fingers) (Fig. 4-18). Marfan 
patients also typically exhibit joint hypermobility. 

The cardiovascular features of Marfan syndrome can be life- 
threatening. Most patients with Marfan syndrome develop 
prolapse of the mitral valve, a condition in which the cusps of 
the mitral valve protrude upward into the left atrium during 
systole, which can result in mitral regurgitation (leakage of 
blood back into the left atrium from the left ventricle). Mitral 
valve prolapse, however, is seen in 1 % to 3% of the general 
population and is often of little consequence. A more serious 
complication is dilatation (widening) of the ascending aorta, 
which is seen in 90% of Marfan patients. As dilatation increases, 
the aorta becomes susceptible to dissection or rupture, particu¬ 
larly when cardiac output is high (as in heavy exercise or preg¬ 
nancy). As the aorta widens, the left ventricle enlarges, and 
cardiomyopathy (damage to the heart muscle) ensues. The end 
result is congestive heart failure, a common cause of death 
among Marfan syndrome patients. 

Most cases of Marfan syndrome are caused by mutations in 
a gene, FBN1, that is expressed in the aorta, the periosteum, 
and the suspensory ligament of the lens. Because FBNl 
encodes a connective tissue protein, fibrillin, mutations of this 
gene alter the structure of connective tissue. This helps to 
explain some of the cardiovascular and ocular features of this 
disorder. Hundreds of different FBNl mutations have been 
identified in Marfan syndrome patients. Most of these are mis- 
sense mutations, but frameshifts and nonsense mutations 
producing a truncated fibrillin protein are also seen. In many 
cases, the missense mutations produce a more severe disease 
phenotype because of a dominant negative effect (i.e., the 
abnormal fibrillin proteins bind to and disable many of the 
normal fibrillin proteins produced by the normal allele in a het¬ 
erozygote). A severe neonatal form of the disease is produced 
by mutations in exons 24 to 32. At least one Marfan syndrome 
compound heterozygote has been reported. This infant, who 
inherited a disease-causing allele from each of its affected 
heterozygous parents, had severe congestive heart failure and 
died from cardiac arrest at 4 months of age. 

Specific mutations in FBNl can cause familial arachnodactyly 
(with no other symptoms of Marfan syndrome), whereas 
others can cause familial ectopia lentis. A disease called con¬ 
genital contractural arachnodactyly exhibits many of the skel¬ 
etal features of Marfan syndrome but does not involve cardiac 
or ocular defects. This disease is caused by mutations in a 
second gene, FBN2, that encodes another form of fibrillin. 

A small percentage of persons with Marfan syndrome do not 
have mutations in FBNl or FBN2 but instead have mutations 
in the gene that encodes transforming growth factor (3 receptor 



FIG 4-18 A, A young man with Marfan syndrome. Note 
characteristically long limbs and narrow face. B, Arachno¬ 
dactyly in an 8-year-old girl with Marfan syndrome. Note 
projection of the thumb well beyond the edge of the 
palm (Steinberg thumb sign). (A from Jones KL. Smith's 
Recognizable Patterns of Human Malformation. 6th ed. 
Philadelphia: Saunders; 2006:549. B from Jones KL. Smith's 
Recognizable Patterns of Human Malformation. 6th ed. 
Philadelphia: Mosby; 2006.) 


2 (TGFBR2). These mutations increase the signaling activity of 
transforming growth factor ft (TGF-p), which contributes to 
aortic dilatation and abnormal bone growth. It is interesting that 
the fibrillin protein contains a domain that binds TGF-p, and 
mutations that disrupt fibrillin increase TGF-p signaling. Thus, 
FBNl mutations can produce structural connective tissue 
abnormalities as well as abnormal TGF-p activity, accounting 
for the pleiotropic features of this disorder. 


Continued 
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CLINICAL COMMENTARY 4-5 

Marfan Syndrome: An Example of Pleiotropy—cont'd 

Treatment for Marfan syndrome includes regular ophthalmo- 
loglcal examinations and, for individuals with aortic dilatation, 
the avoidance of heavy exercise and contact sports. In addition, 
P-adrenergic blockers (e.g., atenolol) can be administered to 
decrease the strength and abruptness of heart contractions. 
This reduces stress on the aorta, although It Is unclear whether 
these drugs reduce aortic dilatation. In some cases, the aorta 
and aortic valve are surgically replaced with a synthetic tube 
and artificial valve. With such treatment, persons with Marfan 
syndrome can achieve nearly normal life spans. 

Another possible avenue for treatment has been opened by 
the discovery of elevated TGF-p signaling in Marfan syndrome. 
In mouse models of this disorder, administration of TGF-p 
antagonists has been shown to prevent aortic dilatation. One 


of these agents, losartan, is an angiotensin II type 1 receptor 
antagonist and is commonly used to treat high blood pressure. 
This drug has been evaluated In clinical trials and has been 
shown to cause a significant reduction in aortic dilatation in 
patients with Marfan syndrome. 

A number of historical figures might have had Marfan syn¬ 
drome, Including Niccolo Paganini, the violinist, and Sergei 
Rachmaninoff, the composer and pianist, and U.S. President 
Abraham Lincoln. Some of Lincoln's skeletal features are seen 
In Marfan syndrome, and examination of his medical records 
has shown that he had features consistent with aortic dilatation. 
It has been suggested that he was In congestive heart failure 
at the time of his death and that, had he not been assassinated, 
he still would not have survived his second term of office. 


BOX 4-1 Measurement of Consanguinity: The Coefficient of Relationship 


To determine the possible consequences of a consanguineous 
mating, it is useful to know what percentage of DNA Is shared 
by two related Individuals because of their descent from a 
common ancestor. The coefficient of relationship is a measure 
of this percentage. Clearly, Individuals who are more closely 
related must share a greater percentage of their genes. To 
begin with a simple example, an individual receives half of his 
or her genes from each parent. Thus, the coefficient of relation¬ 
ship between a parent and offspring is 1/2. This also means 
that the probability that the parent and offspring share a given 
DNA sequence (e.g., a disease allele) Is 1/2. 

To continue with a more complex example, suppose that a 
man is known to be a heterozygous carrier for galactosemia, a 
relatively rare autosomal recessive metabolic disorder. If he 
mates with his first cousin, what is the probability that she also 
carries this disease-causing allele? We know that this probabil¬ 
ity must be higher than that of the general population, because 
first cousins share one set of grandparents. There is thus a 
possibility that the grandparent who transmitted the galactose¬ 
mia allele to the known carrier also transmitted it to the carri¬ 
er's cousin. The coefficient of relationship specifies this 
probability. A pedigree for a first-cousin mating is shown below 
In panel A on the left. The male carrier is labeled A, and his 
female cousin is labeled E. Because we are Interested only in 
the family members who are related to both the man and his 
cousin, the pedigree is condensed, in panel B on the right, to 


include only those individuals who form a path between the 
man and his cousin. 

To estimate the coefficient of relationship, we begin with the 
carrier and ascend the pedigree. We know that there is a prob¬ 
ability of 1/2 that the known carrier inherited the gene from the 
parent in the path (labeled B). There is also a probability of 1/2 
that he inherited the gene from his other parent, who is not 
related to his cousin and is thus not included in the diagram. 
By similar reasoning, the probability that individual B inherited 
the disease gene from his parent, individual C, Is also 1/2. The 
probability that individual C in turn passed on the disease gene 
to his offspring, D, is 1/2, and the probability that D passed the 
disease gene to E is also 1/2. Thus, for E to share a disease- 
causing allele with A, each of these four events must have 
taken place. The multiplication rule dictates that, to find the 
probability that all four events have taken place, we take the 
product of all four probabilities. Because each of these proba¬ 
bilities is 1/2, the result Is (1/2)'*= 1/16. 

If Individuals A and E shared only one grandparent, the coef¬ 
ficient of relationship would be 1/16. But, as with most first 
cousins, they share a common grandfather and grandmother. 
Thus, there are two paths through which the disease gene could 
have passed. To obtain the probability that the gene passed 
through the second path, we use the same procedure as in the 
previous paragraph and obtain a probability of 1/16. Now we 
need to estimate the probability that the gene went through 


A, Pedigree for a first-cousin mating. 

B, The pedigree is condensed to 
show only those individuals who are 
related to both of the first cousins. 
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BOX 4-1 Measurement of Consanguinity: The Coefficient of Relationship—cont'd 


either the first path or the second (i.e., through one grandparent 
or the other). The addition rule states that we can add these 
two probabilities together to get the overall probability that A 
and E share a disease gene: 1/16 + 1/16 = 1/8. The probability 
that the carrier's cousin shares his disease allele, as a result of 
their descent from a common set of grandparents, Is thus 1/8. 
This is the coefficient of relationship for first cousins.* * 

It should be recognized that individual E could also Inherit a 
disease allele from an ancestor not Included In either of these 
paths. However, for disease alleles that are relatively rare in 
populations, this probability is small and can usually be 
disregarded. 


The rules for calculating the coefficient of relationship can be 

summarized as follows: 

1. Each individual can appear In a route only once. 

2. Always begin with one individual, proceed up the pedigree 
to the common ancestor, then down the pedigree to the 
other individual. 

3. The coefficient of relationship for one route is given by 
(1/2)"“\ where n is the number of individuals in the route. 

4. If there are multiple routes (I.e., multiple common ances¬ 
tors), the probabilities estimated for each route are added 
together. 


*A related quantity, often used in population genetics, is the inbreeding coefficient. This coefficient is the probability that an individual is 
homozygous at a locus as a result of consanguinity in his or her parents. For a given type of mating, the inbreeding coefficient of an 
individual always equals the parents' coefficient of relationship multiplied by 1/2 (e.g., the inbreeding coefficient for the offspring of a 
first-cousin mating is 1/16). 


TABLE 4-2 Mortality Levels among Cousin and Unrelated Control Marriages in Selected 
Human Populations 


POPULATION 

MORTALITY 

TYPE 

1.0 COUSIN 

1.5 COUSIN* 

2.0 COUSIN 

UNRELATED 

% 

N 

% 

N 

% 

N 

% 

N 

Amish (Old Order) 

Prereproductive 

14.4 

1218' 

— 

— 

13.3 

6064 

8.2 

17,200 

Bombay, India 

Perinatal 

4.8 

3309 

2.8 

176 

0 

30 

2.8 

35,620 

France (Loir-et-Cher) 

Prereproductive 

17.7 

282 

6.7 

105 

11.7 

240 

8.6 

1117 

Fukuoka, Japan 

0 to 6 yr 

10.0 

3442 

8.3 

1048 

9.2 

1066 

6.4 

5224 

Hirado, Japan 

Prereproductive 

18.9 

2301 

15.3 

764 

14.7 

1209 

14.3 

28,569 

Kerala, India 

Prereproductive 

18.6 

391 

— 

— 

11.8 

34 

8.7 

770 

Punjab, Pakistan 

Prereproductive 

22.1 

3532 

22.9 

1114 

20.1 

57 

16.4 

4731 

Sweden 

Prereproductive 

14.1 

185 

13.7 

227 

11.4 

79 

8.6 

625 

Utah Mormons 

Prereproductive 

22.4 

1048 

15.3 

517 

12.2 

1129 

13.2 

302,454 


Modified from Jorde LB. Inbreeding in human populations. In: Dulbecco R, ed. Encyclopedia of Human Biology. Vol. 5. New York: Academic 
Press: 1997:1-13. 

*First cousins once removed. 

"^Includes 1.5 cousins. 


marriages as among the offspring of unrelated persons. First- 
cousin marriages are iUegal in most states of the United 
States, and marriages between closer relatives (except double 
first cousins, who share both sets of grandparents) are pro¬ 
hibited throughout the United States. 

Very few data exist for matings between siblings or parents 
and offspring (defined as incest). The limited data indicate 
that between % and )4 of incestuous matings result in off¬ 
spring affected by serious disease, with mental impairment as 
one of the most common consequences. Because of small 


sample sizes in these studies, it is difficult to separate the 
effects of genetics from those of a substandard environment. 
It is likely that the problems experienced by the offspring of 
incestuous matings are caused by both genetic and environ¬ 
mental influences. 

Atthe population level, consanguinity increases 
the frequency of genetic disease and mortality. 
The closer the degree of consanguinity, the 
greater the increase. 



I STUDY QUESTIONS 


1. A man who has achondroplasia marries a phenotypically 
normal woman. If they have four children, what is the 
probability that none of their children will be affected with 
this disorder? What is the probability that all of them will 
be affected? 

2. The estimated penetrance for familial retinoblastoma is 
approximately 90%. If a man has had familial retinoblas¬ 
toma and mates with a woman who does not have a reti¬ 


noblastoma mutation, what is the risk that their offspring 
will develop retinoblastoma? 

3. A 30-year-old woman had a sister who died from infantile 
Tay-Sachs disease, an autosomal recessive disorder that is 
fatal by age 6 years. What is the probability that this woman 
is a heterozygous carrier of the Tay-Sachs mutation? 

4. A man has neurofibromatosis type 1. His mother also has 
this condition. What is the probability that his sister also 
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has this disease? In the absence of knowledge of his sister’s 
phenotype, what is the probability that his sister’s daugh¬ 
ter has neurofibromatosis type 1? 

5. Consider a woman who is a known heterozygous carrier 
of a mutation that causes PKU (autosomal recessive). 
What is the probability that her two grandchildren, who 
are first cousins, are both heterozygous carriers of this 
PKU-causing allele? Suppose instead that the woman is 
affected with PKU. Now what is the probability that both 
of her grandchildren carry the disease-causing allele? 

6. Two mating individuals, labeled A and B in Figure 4-19, 
share a single great-grandparent. What is their coefficient 
of relationship? Suppose that one member of this couple 
is a heterozygous carrier for PKU. What is the probability 
that this couple will produce a child affected with PKU? 

7. A suspect in a rape case has been typed for three STR 
(short tandem repeat) loci. His alleles match those of the 
evidentiary sample (semen taken from the rape victim) for 
each locus. He is a heterozygote for the first two loci and 
a homozygote for the third. The allele frequencies for locus 
1 in the general population are 0.05 and 0.10. For locus 2, 
they are 0.07 and 0.02. For locus 3, the allele frequency in 
the general population is 0.08. What is the probability that 



FIG 4-19 Diagram for study question 6. 

a random individual in the general population would 
match the evidentiary sample? 

8. A man implicated in a paternity suit has had his DNA 
tested to establish whether or not he is the father of the 
baby. Four STR loci were tested for him, the mother, and 
the baby. The baby’s alleles and the man’s alleles match for 
aU four loci. The frequencies of these alleles in the general 
population are 0.05,0.01,0.01, and 0.02. What is the prob¬ 
ability that someone else in the general population could 
be the father of the baby? 
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CHAPTER 


Sex-Linked and Nontraditional 
Modes of Inheritance 


The previous chapter dealt with diseases that are inherited in 
patterns that were first elucidated by Gregor Mendel. In this 
chapter we discuss disease-causing mutations that are inher¬ 
ited in ways that were unknown to Mendel and are thus 
sometimes termed nonmendelian. 

The first mutations to be discussed are DNA variants of 
the sex chromosomes (X and Y), known as sex-linked muta¬ 
tions. The human X chromosome is large, containing about 
5% of the nuclear genome’s DNA (approximately 155 million 
base pairs [155 megabases, 155 Mb]). More than 1200 genes 
have been localized to the X chromosome, and the diseases 
caused by these genes are said to be X-linked. In contrast to 
the X chromosome, the Y chromosome is quite small (60 Mb) 
and contains only a few dozen genes. 

Another group of disease-causing mutations is located in 
the mitochondrial genome, which is inherited only from 
one’s mother. Mitochondrial diseases thus display a unique 
pattern of inheritance in families. Extensive analyses have 
revealed a growing number of disease-causing mutations in 
the mitochondrial genome. 

Finally, we discuss two processes that have been eluci¬ 
dated only in the past 2 to 3 decades: anticipation and 
imprinting. Anticipation refers to earlier age-of-onset of 
some genetic diseases in more recent generations of families. 
Imprinting refers to the fact that some genes are expressed 
only on paternally transmitted chromosomes and others are 
expressed only on maternally transmitted chromosomes. 
Our understanding of both of these processes has been 
greatly enhanced by detailed molecular analyses of humans 
and model organisms. 

X INACTIVATION 

The X chromosome contains many important protein-coding 
genes, and it has long been known that human females have 
two X chromosomes and males have only one. Thus, females 
have two copies of each X-linked gene, and males have only 
one copy. Yet males and females do not differ in terms of the 
amounts of protein products (e.g., enzyme levels) encoded by 
most of these genes. What could account for this? 

In the early 1960s Mary Lyon hypothesized that one X 
chromosome in each somatic cell of the female is inactivated. 
This would result in dosage compensation, an equalization 
of the amount of X-linked gene products in males and 
females. The Lyon hypothesis stated that X inactivation 


occurs early in female embryonic development and that the 
X chromosome contributed by the father is inactivated in 
some cells, whereas in other cells the X chromosome contrib¬ 
uted by the mother is inactivated. In each cell, one of the two 
X chromosomes is chosen at random for inactivation, so the 
maternally and paternally transmitted X chromosomes are 
each inactivated in about half of the embryo’s cells. Thus, 
inactivation, like gamete transmission, is analogous to a coin¬ 
tossing experiment. Once an X chromosome is inactivated in 
a cell, it will remain inactive in all descendants of that cell. X 
inactivation is therefore a randomly determined, hut fixed (or 
permanent), process. As a result of X inactivation, all normal 
females have two distinct populations of cells: one population 
has an active paternally derived X chromosome, and the other 
has an active maternally derived X chromosome. (Fig. 5-1 
provides a summary of this process.) Because they have two 
populations of cells, females are mosaics (see Chapter 4) 
for X chromosome activity. Males, having only one copy of 
the X chromosome, are not mosaics but are hemizygous 
for the X chromosome (hemi means “half”). 

The Lyon hypothesis states that one X chro¬ 
mosome in each cell is randomly inactivated 
early in the embryonic development of females. 
This ensures that females, who have two 
copies of the X chromosome, will produce 
X-linked gene products in quantities roughly 
similar to those produced in males (dosage 
compensation). 

The Lyon hypothesis relied on several pieces of evidence, 
most of which were derived from animal studies. First, it was 
known that females are typically mosaics for some X-linked 
traits and males are not. For example, female “calico” cats 
have alternating black and orange patches of fur that corre¬ 
spond to two populations of cells: one that contains X chro¬ 
mosomes in which an “orange” allele is active and one that 
contains X chromosomes in which a “black” allele is active. 
Male cats of this breed do not exhibit alternating colors and 
have either black or orange patches of fur. Another example, 
seen in humans, is X-linked ocular albinism. This is an 
X-linked recessive condition characterized by a lack of 
melanin production in the retina and by ocular problems 
such as nystagmus (rapid involuntary eye movements) and 
decreased visual acuity. Males who inherit the mutation show 
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a relatively uniform lack of melanin in their retinas, whereas 
female heterozygotes exhibit alternating patches of pigmented 
and nonpigmented tissue (Fig. 5-2). 

The Lyon hypothesis was also supported by biochemical 
evidence. The enzyme glucose-6-phosphate dehydrogenase 
(G6PD) is encoded by a gene on the X chromosome and is 
present in equal quantities in males and females (dosage 



FIG 5-1 The X inactivation process. The maternal (m) and 
paternal (p) X chromosomes are both active in the zygote and 
in early embryonic cells. X inactivation then takes place, 
resulting in cells having either an active paternal X or an active 
maternal X chromosome. Females are thus X chromosome 
mosaics, as shown in the tissue sample at the bottom of the 
figure. 


compensation). In females who are heterozygous for two 
common G6PD alleles (labeled A and B), some skin cells 
produce only the A variant of the enzyme and others produce 
only the B variant. This is further proof of X chromosome 
mosaicism in females. 

Finally, cytogenetic studies in the 1940s showed that inter¬ 
phase cells of female cats often contained a densely staining 
chromatin mass in their nuclei. These masses were not seen 
in males. They were termed Barr bodies, after Murray Barr, 
one of the scientists who described them. Barr and his col¬ 
league Ewart Bertram hypothesized that the Barr body rep¬ 
resented a highly condensed X chromosome. It is now known 
that Barr and Bertram were correct and that the inactive X 
chromosome is observable as a Barr body in the somatic cells 
of normal females. Its condensed state is correlated with 
reduced transcriptional activity, and its DNA is replicated 
later in the S phase than that of other chromosomes. 

The Lyon hypothesis is supported by cytoge¬ 
netic evidence: Barr bodies, which are inactive 
X chromosomes, are seen only in cells with 
two or more X chromosomes. It is also sup¬ 
ported by biochemical and animal studies that 
reveal mosaicism of X-linked traits in female 
heterozygotes. 

Further study has largely verified the Lyon hypothesis. 
Messenger RNA (mRNA) is transcribed from only one X 
chromosome in each somatic cell of a normal female. The 
inactivation process takes place within approximately 7 to 10 
days after fertilization, when the embryonic inner-cell mass 
contains no more than a few dozen cells. Inactivation is initi¬ 
ated in a single 1-Mb region on the X chromosome long arm. 



A I B 


FIG 5-2 Fundus photos of X-linked ocular albinism. A, Fundus photograph of a female heterozy¬ 
gous carrier for X-linked ocular albinism. The pigmented and nonpigmented patches demonstrate 
mosaicism of the X chromosome as a result of random X inactivation. B, Fundus photograph 
of the heterozygous carrier's son, showing a much greater lack of melanin pigment. (Courtesy 
Dr. Donnell J. Creel, University of Utah Flealth Sciences Center.) 
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the X inactivation center, and then spreads along the chro¬ 
mosome. Although inactivation is random among cells that 
make up the embryo itself, only the paternally derived X 
chromosome is inactivated in cells that will become extraem- 
bryonic tissue (e.g., the placenta). X inactivation is perma¬ 
nent for all somatic cells in the female, but the inactive X 
chromosome must later become reactivated in the female’s 
germline so that each of her egg cells will receive one active 
copy of the X chromosome. 

An important implication of the Lyon hypothesis is that 
the number of Barr bodies in somatic cells is always one less 
than the number of X chromosomes. Normal females have 
one Barr body in each somatic cell, and normal males have 
none. Females with Turner syndrome (see Chapter 6), having 
only one X chromosome, have no Barr bodies. Males with 
Klinefelter syndrome (two X chromosomes and a Y chromo¬ 
some) have one Barr body in their somatic cells, and females 
who have three X chromosomes per cell have two Barr bodies 
in each somatic cell. This pattern leads to another question: 
if the extra X chromosomes are inactivated, why aren’t people 
with extra (or missing) X chromosomes phenotypically 
unaffected? 

The answer to this question is that X inactivation is incom¬ 
plete. Some regions of the X chromosome remain active in all 
copies. For example, the tips of the short and long arms of 
the X chromosome do not undergo inactivation. The tip of 
the short arm of the X chromosome is homologous to the 
distal short arm of the Y chromosome (see Chapter 6). In 
total, about 15% of the genes on the human X chromosome 
escape inactivation, and relatively more genes on the short 
arm escape inactivation than on the long arm. Some of the 
X-linked genes that remain active on both copies of the X 
chromosome have homologs on the Y chromosome, preserv¬ 
ing equal gene dosage in males and females. Thus, having 
extra (or missing) copies of active portions of the X chromo¬ 
some contributes to a disease phenotype. 

X inactivation is random, fixed, and incom¬ 
plete. The last feature helps to explain why, 
despite X inactivation, most persons with 
abnormal numbers of sex chromosomes have 
a disease phenotype. 

The X inactivation center contains a gene, XIST, that is 
transcribed only on the inactive X chromosome; its 17-kb 
mRNA transcripts are detected in normal females but not in 
normal males. The RNA transcript, however, is not translated 
into a protein and is an example of a long noncoding RNA 
(IncRNA; compare with miRNA in Chapter 2). The XIST 
RNA transcript remains in the nucleus and coats the inactive 
X chromosome. This process acts as a signal that leads to 
other aspects of inactivation, including late replication and 
condensation of the inactive X chromosome. 

Methylation and histone deacetylation are additional fea¬ 
tures of the inactive X chromosome. Many CG dinucleotides 
in the 5' regions of genes on the inactive X are heavily methyl¬ 
ated, and the administration of demethylating agents, such as 


5-azacytidine, can partially reactivate an inactive X chromo¬ 
some in vitro. However, methylation does not appear to be 
involved in spreading the inactivation signal from the inacti¬ 
vation center to the remainder of the X chromosome. It is 
more likely to be responsible for maintaining the inactivation 
of a specific X chromosome in a cell and all of its 
descendants. 

The XIST gene is located in the X inactivation 
center and is required for X inactivation. It 
encodes a IncRNA product that coats the 
inactive X chromosome. X inactivation is 
also associated with methylation of the 
inactive X chromosome, a process that might 
help to ensure the long-term stability of 
inactivation. 

SEX-LINKED INHERITANCE 

Sex-linked genes are those that are located on either the X or 
the Y chromosome. Because only a few dozen genes are 
known to be located on the human Y chromosome, our atten¬ 
tion will be focused mostly on X-linked diseases. These have 
traditionally been grouped into X-linked recessive and 
X-linked dominant categories, and these categories are used 
here for consistency with other literature. However, because 
of variable expression, incomplete penetrance, and the effects 
of random X inactivation, the distinction between X-linked 
dominant and X-linked recessive inheritance is sometimes 
ambiguous. 

X-Linked Recessive Inheritance 

A number of well-known diseases and traits are caused by 
X-linked recessive genes. These include hemophilia A (Clini¬ 
cal Commentary 5-1), Duchenne muscular dystrophy (Clini¬ 
cal Commentary 5-2), and red-green colorblindness (Box 
5-1). Additional X-linked diseases are listed in Table 5-1. The 
inheritance patterns and recurrence risks for X-linked reces¬ 
sive diseases differ substantially from those for diseases caused 
by autosomal genes. 

Because females inherit two copies of the X chromosome, 
they can be homozygous for a disease-causing allele at a given 
locus, heterozygous at the locus, or homozygous for the 
normal allele at the locus. In this way, X-linked loci in females 
are much like autosomal loci. However, for most X-linked 
loci, there is only one copy of the allele in an individual 
somatic cell because of X inactivation. This means that about 
half of the cells in a heterozygous female will express the 
disease allele and half will express the normal allele. 

Thus, as with autosomal recessive traits, the heterozygote 
will produce about 50% of the normal level of the gene 
product. Ordinarily this is sufficient for a normal phenotype. 
The situation is different for males, who are hemizygous for 
the X chromosome. If a male inherits a recessive disease gene 
on the X chromosome, he will be affected by the disease 
because the Y chromosome does not carry a normal allele to 
compensate for the effects of the disease allele. 
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CLINICAL COMMENTARY 5-1 
Hemophilia A 

Hemophilia A is caused by mutations in the gene that encodes 
clotting factor VIII and affects approximately 1 in 5000 to 1 in 
10,000 males worldwide. It is the most common of the severe 
bleeding disorders and has been recognized as a familial disor¬ 
der for centuries. The Talmud states that boys whose brothers 
or cousins bled to death during circumcision are exempt from 
the procedure (this may well be the first recorded example of 
genetic counseling). 

Hemophilia A is caused by deficient or defective factor VIII, 
a key component of the clotting cascade. Fibrin formation is 
affected, resulting in prolonged and often severe bleeding from 
wounds and hemorrhages in the joints and muscles (Fig. 5-3). 
Bruising is often seen. Hemarthroses (bleeding into the joints) 
are common in the ankles, knees, hips, and elbows. These 
events are often painful, and repeated episodes can lead to 
destruction of the synovium and diminished joint function. 
Intracranial hemorrhages can occur and are a leading cause of 
death. Platelet activity Is normal in hemophiliacs, so minor 
lacerations and abrasions do not usually lead to excessive 
bleeding. 

Hemophilia A varies considerably in its severity, and this varia¬ 
tion is correlated directly with the level of factor VIII. About half 
of hemophilia A patients fall into the severe category, with 
factor VIII levels that are less than 1 % of normal. These persons 
experience relatively frequent bleeding episodes, often several 
per month. Patients with moderate hemophilia (1-5% of normal 
factor VIII) generally have bleeding episodes only after mild 
trauma and typically experience one to several episodes per 
year. Persons with mild hemophilia have 5% to 30% of the 
normal factor VIII level and usually experience bleeding epi¬ 
sodes only after surgery or relatively severe trauma. 

Historically, hemophilia A was often fatal before 20 years of 
age, but a major advance in treatment came in the early 1960s 


with the ability to purify factor VIII from donor plasma. Factor 
VIII Is usually administered at the first sign of a bleeding 
episode and is a highly effective treatment. Prophylactic factor 
VIII administration In severe hemophiliacs is effective in pre¬ 
venting loss of joint function. By the 1970s, the median age at 
death of persons with hemophilia had increased to 68 years. 

The major drawback of donor-derived factor VIII was the fact 
that, because a typical Infusion contained plasma products 
from hundreds or thousands of different donors, it was often 
contaminated by viruses. Consequently, patients often suf¬ 
fered from hepatitis B and C infections. More seriously, human 
Immunodeficiency virus (HIV) can be transmitted In this 
manner, and it is estimated that half of American hemophilia 
patients treated with donor-derived factor VIII between 1978 
and 1985 became Infected with HIV. From 1979 to 1998, 
acquired Immune deficiency syndrome (AIDS) accounted for 
nearly half of deaths among Americans with hemophilia A, 
which resulted in a decrease in the median age at death to 49 
years in the 1980s. Donor blood has been screened for HIV 
since 1985, and heat treatment of donor-derived factor VIII kills 
HIV and hepatitis B virus, nearly eliminating the threat of Infec¬ 
tion. Consequently, AIDS mortality among those with hemo¬ 
philia A has decreased markedly since 1995. 

Cloning and sequencing of the factor VIII gene has led to a 
number of insights. Patients with nonsense or frameshift muta¬ 
tions usually develop severe hemophilia, and those with mls- 
sense mutations usually have mild to moderate disease. This is 
expected because nonsense and frameshift mutations typically 
produce a truncated protein that is degraded and lost. Missense 
mutations produce a single amino acid substitution without a 
dominant negative effect, usually resulting in an altered but 
partially functional protein product. Many of the point mutations 
take place at methylated CG sequences, which are hot spots for 



FIG 5-3 A, The enlarged right knee joint of a patient with hemophilia A, demonstrating the effects 
of hemarthrosis. B, Extensive bruising of the right outer thigh. (From Hoffbrand VA. Color Atlas 
of Clinical Hematology. 3rd ed. Philadelphia: Mosby; 2000:281-283.) 
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CLINICAL COMMENTARY 5-1 
Hemophilia A—cont'd 
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FIG 5-4 A pedigree showing the inheritance of hemophilia B in the European royal families. The 
first known carrier in the family was Queen Victoria. Note that all of the affected individuals are 
male. (Modified from McCance K, Huether S. Pathophysiology: The Biologic Basis for Disease 
in Adults and Children. 5th ed. St. Louis; Mosby; 2005.) 


mutation (see Chapter 3). About 45% of severe cases of hemo¬ 
philia A are caused by a chromosome inversion (see Chapter 6) 
that disrupts the factor VIII gene. An additional 5% of patients 
have deletions, which usually lead to relatively severe disease. 
About 10% of female heterozygotes have factor VIII levels less 
than 35%, and some of these are manifesting heterozygotes, 
with mild symptoms of hemophilia A. 

Cloning of the factor VIII gene has enabled the production of 
human factor VIII using recombinant DNA techniques. Exten¬ 
sive clinical testing showed that recombinant factor VIII works 
as effectively as the donor-derived form, and it was approved 
for commercial use in 1994. Recombinant factor VIII has, of 
course, the advantage that there is no possibility of viral con¬ 
tamination. However, as with other forms of factor VIII, recom¬ 
binant factor VIII generates antifactor VIII antibody production 
in approximately 10% to 15% of patients. (This response is 
most common in patients who have no native factor VIII 
production.) 

Two other major bleeding disorders are hemophilia B and von 
Willebrand disease. Hemophilia B, sometimes called Christmas 
disease,* is also an X-linked recessive disorder and is caused 
by a deficiency of clotting factor IX. This condition is about one 
fifth as common as hemophilia A and can be treated with 
donor-derived or recombinant factor IX. Von Willebrand disease 
is an autosomal dominant disorder that is highly variable in 
expression. Although it can affect as many as 1 % of individuals 
of European descent, it reaches severe expression in fewer 
than 1 in 10,000. The von Willebrand factor, which is encoded 


by a gene on chromosome 12, acts as a carrier protein for 
factor VIII. In addition, it binds to platelets and to damaged 
blood vessel endothelium, thus promoting the adhesion of 
platelets to damaged vessel walls. 

Hemophilia is of historical interest because it affected 
members of the royal families of Germany, Spain, England, and 
Russia (Fig. 5-4). Among these families. Queen Victoria of 
England was the first known heterozygous hemophilia carrier. 
She had one affected son, and two of her daughters had 
affected sons, making them presumptive carriers. One of her 
affected great-grandsons was the Tsarevitch Alexei of Russia, 
the son of Tsar Nicholas II and Alexandra. Grigori Rasputin, 
called the "mad monk," had an unusual ability to calm the 
Tsarevitch during bleeding episodes, probably through hypno¬ 
sis. As a result, he came to have considerable influence in the 
royal court, and some historians believe that his destabilizing 
effect helped to bring about the 1917 Bolshevik revolution. 
Recently, the Russian royal family was again touched by genet¬ 
ics. Using the polymerase chain reaction, autosomal DNA mic¬ 
rosatellites and mitochondrial DNA sequences were assayed 
in the remains of several bodies exhumed near Yekaterinburg, 
the reputed murder site of the royal family. Analysis of this 
genetic variation and comparison with living maternal relatives 
showed that the bodies were indeed those of the Russian royal 
family. Further analysis demonstrated that Alexei had a patho¬ 
genic mutation in the gene that encodes factor IX, establishing 
that the royal families were affected by hemophilia B (rather 
than the more common hemophilia A). 


Christmas was the last name of the first reported patient. 































































84 


CHAPTER 5 Sex-Linked and Nontraditional Modes of Inheritance 


CLINICAL COMMENTARY 5-2 
Duchenne Muscular Dystrophy 

Muscular dystrophy, defined as a progressive weakness and 
loss of muscle, exists in dozens of different forms. Of these, 
Duchenne muscular dystrophy (DMD), named after the French 
neurologist who provided the first comprehensive description 
in 1868, is one of the most severe and common forms. It 
affects approximately 1 of every 4,000 males, a prevalence 
figure that is similar among all ethnic groups studied thus far. 

The symptoms of DMD are usually seen before the age of 5 
years, when parents begin to notice unusual clumsiness and 
muscle weakness. Pseudohypertrophy of the calves, the result 
of infiltration of muscle by fat and connective tissue, is often 
seen early in the course of the disease. All skeletal muscle 
degenerates eventually, and most patients with DMD are con¬ 
fined to a wheelchair by 11 years of age. The heart and respira¬ 
tory musculature become impaired, and death usually results 
from respiratory or cardiac failure. Survival beyond age 25 years 
is uncommon: little can be done to alter the ultimate course of 
this disease. 

As muscle cells die, the enzyme creatine kinase (CK) leaks 
into the blood stream. In DMD patients, serum CK is elevated 
at least 20 times above the upper limit of the normal range. 
This elevation can be observed presymptomatically, before 
clinical symptoms such as muscle wasting are seen. Other 
traditional diagnostic tools include electromyography, which 
reveals reduced action potentials, and muscle biopsy. 

Female heterozygous carriers of the DMD-causing muta¬ 
tions are usually free of disease, although 8% to 10% have 
some degree of muscle weakness. In addition, serum CK 
exceeds the 95th percentile in approximately two thirds of 
heterozygotes. 

Until the gene responsible for DMD was identified in 1986, 
little was known about the mechanism responsible for muscle 
deterioration. The DMD gene covers approximately 2.5 Mb of 
DNA, making it by far the largest gene known in the human. It 
contains 79 exons that produce a 14-kb mRNA transcript. 
Because of this gene's huge size, transcription of an mRNA 
molecule can take as long as 24 hours. The protein product, 
named dystrophin, was unknown before the identification of 
DMD. Dystrophin accounts for only about 0.002% of a striated 
muscle cell's protein mass and is localized on the cytoplasmic 
side of the cell membrane. Its amino terminus binds F-actin, a 
key cytoskeletal protein. The carboxyl terminus of dystrophin 
binds a complex of glycoproteins, known as the dystroglycan- 
sarcoglycan complex, that spans the cell membrane and binds 
to extracellular proteins. Dystrophin thus links these two 
cellular components and plays a key role In maintaining the 
structural integrity of the muscle cell. Lacking dystrophin, 
the muscle cells of the DMD patient gradually die as they are 
stressed by muscle contractions. 

The large size of DMD helps to explain its high mutation rate, 
about 10^ per locus per generation. As with the gene respon¬ 
sible for neurofibromatosis type 1, the DMD gene presents a 
large target for mutation. A slightly altered form of the DMD 
gene product is normally found in brain cells, in which the 


transcription initiation site is farther downstream in the gene 
and a different promoter is used. Thus, the mRNA transcript 
and the resulting gene product differ from the gene product 
found in muscle cells. Dystrophin's expression in the brain 
helps to explain why approximately 25% of DMD patients have 
some degree of intellectual disability. Several additional pro¬ 
moters have also been found, providing a good example of a 
single gene that can produce different gene products as a 
result of modified transcription. 

Becker muscular dystrophy (BMD), another X-linked reces¬ 
sive dystrophic condition. Is less severe than the Duchenne 
form. The progression is also much slower, with onset at 11 
years of age, on average. One study showed that whereas 
95% of DMD patients are confined to a wheelchair before 12 
years of age, 95% of those with BMD become wheelchair 
bound after 12 years of age. Some never lose their ability to 
walk. BMD is less common than DMD, affecting about 1 in 
18,000 male births. 

For some time it was unclear whether BMD and DMD are 
caused by distinct X-linked loci or by different mutations at the 
same locus. Identification of DMD showed the latter to be the 
case. Both diseases usually result from deletions (65% of DMD 
cases and 85% of BMD cases) or duplications (6% to 7% of 
DMD and BMD cases) in DMD. But, whereas the great major¬ 
ity of DMD-causing deletions and duplications produce frame- 
shifts, the majority of BMD-causing mutations are in-frame 
alterations (i.e., a multiple of three bases is deleted or dupli¬ 
cated). One would expect that a frameshift, which Is likely to 
produce a premature stop codon (see Chapter 3) and no protein 
product, would produce more-severe disease than would an 
In-frame alteration. 

The consequences of these different mutations can be 
observed in the gene product. Although dystrophin is absent 
in almost all DMD patients. It is usually present in reduced 
quantity (or as a shortened form of the protein) in BMD patients. 
Thus, a dystrophin assay can help to distinguish between 
the two diseases. This assay also helps to distinguish both 
diseases from other forms of muscular dystrophy, because 
several of these forms (e.g., various limb-girdle muscular dys¬ 
trophies) result from mutations in genes that encode proteins 
of the dystroglycan-sarcoglycan complex, whereas dystrophin 
appears to be affected only In BMD and DMD (Figs. 5-5, 5-6, 
and 5-7). 

Genetic testing (see Chapter 13) is now commonly used to 
help diagnose DMD and BMD; a disease-causing mutation is 
identified in about 95% of cases. Considerable research has 
been done on genetically based therapies for muscular dystro¬ 
phy. For example, small-molecule drugs can cause ribosomes 
to read through premature stop codons and have shown 
promise In mouse models. Antisense oligonucleotides (see 
Chapter 13) can induce the skipping of the altered DMD exon 
during mRNA translation. These approaches, along with gene 
therapy in which a normal copy of DMD is inserted into muscle 
cells, are now being tested In clinical trials. 
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p-Sarcoglycan • 


Cytoskeleton 

FIG 5-7 The amino terminus of the dystrophin protein binds to F-actin in the cell's cytoskeleton, 
and its carboxyl terminus binds to elements of the dystroglycan-sarcoglycan complex. The latter 
complex of glycoproteins spans the cell membrane and binds to proteins in the extracellular 
matrix, such as laminin. 


CLINICAL COMMENTARY 5-2 
Duchenne Muscular Dystrophy—cont'd 


Dystrophin 


FIG 5-6 Transverse section of gastrocnemius muscle from 
(A) a normal boy and (B) a boy with Duchenne muscular 
dystrophy. Normal muscle fiber is replaced with fat and 
connective tissue. 


FIG 5-5 A patient with late-stage Duchenne muscular dys¬ 
trophy, showing severe muscle loss. 
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BOX 5-1 Color Vision: Molecular Biology and Evolution 


Human vision depends on a system of retinal photoreceptor 
cells, about 95% of which are rod cells. They contain the light¬ 
absorbing protein rhodopsin and allow us to see in conditions 
of dim light. In addition, the retina contains three classes of 
cone cells, which contain light-absorbing proteins (opsins) that 
react to light wavelengths corresponding to the three primary 
colors—red, green, and blue. Color vision depends on the pres¬ 
ence of all four of these cell types. Because three major colors 
are Involved, normal color vision is said to be trichromatic. 

There are many recognized defects of human color vision. 
The most common of these involve red and green color percep¬ 
tion and have been known since 1911 to be inherited in X-linked 
recessive fashion. Thus, they are much more common in males 
than in females. Various forms of red-green colorblindness are 
seen in about 8% of European males, 4% to 5% of Aslan 
males, and 1 % to 4% of African and Native American males. 
Among European males, 2% are dichromatic: they are unable 
to perceive one of the primary colors, usually red or green. The 
Inability to perceive green is termed deuteranopia, and the 
inability to perceive red is termed protanopia. About 6% of 
European males can detect green and red, but with altered 
perception of the relative shades of these colors. These are 
respectively termed deuteranomalous and protanomalous 
conditions. 



AVflCACO 




AVflOlco 


A, Image perceived by a person with normal color vision. 

B, The predicted perception by a person with protanopia, a 
form of red-green colorblindness. (Copyright George V. 
Kelvin.) 


It should be apparent that dichromats are not really color 
blind, because they can still perceive a fairly large array of dif¬ 
ferent colors. True colorblindness (monochromacy, the ability 
to perceive only one color) is much less common, affecting 
approximately 1 in 100,000 persons. There are two major 
forms of monochromatic vision. Rod monochromacy is an 
autosomal recessive condition in which all visual function is 
carried out by rod cells. Blue cone monochromacy is an 
X-linked recessive condition in which both the red and green 
cone cells are absent. 

Cloning of the genes responsible for color perception has 
revealed a number of interesting facts about both the biology 
and the evolution of color vision in humans. In the 1980s, 
Jeremy Nathans and colleagues reasoned that the opsins in all 
four types of photoreceptor cells might have similar amino acid 
sequences because they carry out similar functions. Thus, the 
DNA sequences of the genes encoding these proteins should 
also be similar. But none of these genes had been located, and 
the precise nature of the protein products was unknown. How 
could they locate these genes? 

Fortunately, the gene encoding rhodopsin in cattle had 
been cloned. Even though humans and cattle are separated by 
millions of years of evolution, their rhodopsin proteins still 
share about 40% of the same amino acid sequence. Thus, the 
cattle (bovine) rhodopsin gene could be used as a probe to 
search for a similar DNA sequence in the human genome. A 
portion of the bovine rhodopsin gene was converted to single¬ 
strand form, radioactively labeled, and hybridized with human 
DNA (much in the same way that a probe is used in Southern 
blotting [see Chapter 3]). Low-stringency hybridization condi¬ 
tions were used: Temperature and other conditions were 
manipulated so that complementary base pairing would occur 
despite some sequence differences between the two species. 
In this way, the human rhodopsin gene was identified and 
mapped to chromosome 3. 

The next step was to use the human rhodopsin gene as a 
probe to identify the cone-cell opsin genes. Each of the cone¬ 
cell opsin amino acid sequences shares 40% to 45% similarity 
with the human rhodopsin amino acid sequence. By probing 
with the rhodopsin gene, the gene for blue-sensitive opsin was 
identified and mapped to chromosome 7. This gene was 
expected to map to an autosome because variants in blue 
sensitivity are inherited in autosomal recessive fashion. The 
genes for the red- and green-sensitive opsins were also Identi¬ 
fied in this way and, as expected, were found to be on the X 
chromosome. The red and green genes are highly similar, 
sharing 98% of their DNA sequence. 

Initially, many Investigators expected that people with color 
vision defects would display the usual array of deletions and 
missense and nonsense mutations seen in other disorders. But 
further study revealed some surprises. It was found that the 
red and green opsin genes are located directly adjacent to each 
other on the distal long arm of the X chromosome and that 
normal persons have one copy of the red gene but can have 
one to several copies of the green gene. The multiple green 
genes are 99.9% identical in DNA sequence, and the presence 
of multiple copies of these genes has no effect on color per¬ 
ception because only the red gene and the first green gene are 
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BOX 5-1 Color Vision: Molecular Biology and Evolution—cont'd 


expressed in the retina. However, when there are no green 
genes, deuteranopia is produced. Persons who lack the single 
red gene have protanopia. 

The unique aspect of these deletions is that they are the 
result of unequal crossover during melosis. Unlike ordinary 
crossover, in which equal segments of chromosomes are 
exchanged (see Chapter 2), unequal crossover results in a loss 
of chromosome material on one chromosome homolog and a 
gain of material on the other. Unequal crossover seems to be 


facilitated by the high similarity in DMA sequence among the 
red and green genes: It Is relatively easy for the cellular machin¬ 
ery to make a mistake in deciding where the crossover should 
occur. Thus, a female with one red gene and two green genes 
could produce one gamete containing a red gene with one 
green gene and another gamete containing a red gene with 
three green genes. Unequal crossover could also result in 
gametes with no copies of a gene, producing protanopia or 
deuteranopia. 



A, Normal individuals have one red gene and one to several green genes. B, Unequal crossover 
causes normal variation in the number of green genes. C, Unequal crossover can produce a 
green dichromat with no green genes (deuteranopia). D, Unequal crossover that occurs within 
the red and green genes can produce a red dichromat (protanopia) or a green anomalous trichro¬ 
mat (deuteranomaly). E, Crossovers within the red and green genes can also produce red 
anomalous trichromats (protanomaly). The degree of red and green color perception depends on 
where the crossover occurs within the genes. (Based on Nathans J, Merbs SL, Sung C, et al. 
The genes for color vision. Sci Am. 1989;260:42-49.) 


Continued 
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BOX 5-1 Color Vision: Molecular Biology and Evolution—cont'd 


Unequal crossover also explains protanomalous and deuter- 
anomalous color vision. Here, crossover takes place within the 
red or green genes, resulting in new chromosomes with hybrid 
genes (e.g., a portion of the red gene fused with a portion of 
the green gene). The relative ratio of red and green compo¬ 
nents of these fusion genes determines the extent and nature 
of the red-green anomaly. 

Because the opsin genes have DMA sequence similarity and 
perform similar functions, they are members of a gene family, 
much like the globin genes (see Chapter 3). This suggests that 
they evolved from a single ancestral gene that, through time, 
duplicated and diverged to encode different but related pro¬ 
teins. Evidence for this process is provided by comparing these 
genes in humans and other species. Because the X-linked red 
and green opsin genes share the greatest degree of DNA 


sequence similarity, we would expect that these two genes 
would be the result of the most recent duplication. Indeed, 
humans share all four of their opsin genes with apes and Old 
World monkeys, but the less closely related New World 
monkeys have only a single opsin gene on their X chromo¬ 
somes. It is therefore likely that the red-green duplication 
occurred sometime after the split of the New and Old World 
monkeys, which took place about 30 to 40 million years ago. 
Similar comparisons date the split of the X-IInked and autoso¬ 
mal cone opsin genes to approximately 500 million years ago. 
And finally, comparisons with the fruit fly. Drosophila melano- 
gaster, Indicate that the duplication that produced the rod and 
cone visual pigment genes may have occurred as much as 
1 billion years ago. 


TABLE 5-1 Additional Examples of X-Linked Recessive Disorders 

NAME 

GENE 

CLINICAL CHARACTERISTICS 

Juvenile retlnoschisls 

RSI 

Progressive visual impairment caused by splitting of the nerve fiber layer of 
the retina: begins in the first or second decade of life; impairment typically 
20/60 to 20/120 

Lerl-Welll dyschondrosls 

SHOX 

Madelung deformity of the radius and ulna; mesomelia (shortening of the 
forearms and lower legs); short stature 

ATR-X 

ATRX 

Mental retardation; genital anomalies; and a-thalassemia without abnormalities 
of the a-globin gene complex 

Hypohydrotlc ectodermal hypoplasia 

EDA 

Diminished sweating and heat intolerance; sparse and light-colored hair, 
eyelashes, and eyebrows; abnormal and/or missing teeth; recurrent upper 
airway infections 

Vitamin D-resistant rickets 

PHEX 

Hypophosphatemia due to reduced renal phosphate reabsorption; short 
stature; bowed legs; poor teeth formation 

Aarskog-Scott syndrome 
(faciogenital dysplasia) 

FGD1 

Short stature: hypertelorism; genital anomalies 

Cleft palate with ankyloglossia 

TBX22 

Cleft palate with or without ankyloglossia (tongue tie) 

Pelizaeus-Merzbacher disease 

PLPl 

Defect of myelination; typically manifests in infancy or early childhood; 
characterized by nystagmus, hypotonia, spasticity, early death 

Nephrogenic diabetes insipidus 

AVPR2 

Impaired response to antidiuretic hormone that leads to inability to 
concentrate urine, polydipsia (excessive thirst), polyuria (excessive urine 
production) 

Otopalatodigital spectrum disorders 

FLNA 

Skeletal dysplasia ranging from mild to lethal; males more severely affected 
than females 


An X-linked recessive disease with gene frequency q will 
be seen in a fraction q of males. This is because a male, having 
only one X chromosome, will manifest the disease if his X 
chromosome contains the disease-causing mutation. Females, 
needing two copies of the mutant allele to express the disease, 
will have a disease frequency of only q^, as in autosomal reces¬ 
sive diseases. For example, hemophilia A (see Clinical Com¬ 
mentary 5-1) is seen in about 1 of every 10,000 males in some 
populations. Thus, in a collection of 10,000 male X chromo¬ 
somes, one chromosome would, on average, contain the 
disease-causing mutation {q = 0.0001). Affected female 
homozygotes are almost never seen, because q^ = 0.00000001, 
or 1 in 100,000,000. This example shows that, in general, 
males are more frequently affected with X-linked recessive 


diseases than are females, with this difference becoming more 
pronounced as the disease becomes rarer. 

Because females have two copies of the X 
chromosome and males have only one (hemi- 
zygosity), X-linked recessive diseases are 
much more common among males than 
among females. 

In addition to their higher prevalence in males than 
females, X-linked recessive diseases display several patterns of 
inheritance that distinguish them from autosomal dominant 
and recessive diseases (Fig. 5-8). Because a father can transmit 
only a Y chromosome to his son, X-linked genes are not 
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FIG 5-8 A pedigree showing the inheri¬ 
tance of an X-linked recessive trait. Solid 
symbols represent affected individuals, 
and dotted symbols represent heterozy¬ 
gous carriers. 
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FIG 5-9 Punnett square representation of the mating of a 
heterozygous female who carries an X-linked recessive 
disease gene with a normal male. X,, Chromosome with 
normal allele; X 2 , chromosome with disease allele. 


FIG 5-10 Punnett square representation of the mating of a 
normal female with a male who is affected by an X-linked 
recessive disease. X",, Chromosome with normal allele; 
X 2 , chromosome with disease allele. 


passed from father to son. (In contrast, father-to-son trans¬ 
mission can be observed for autosomal disease alleles.) An 
X-linked disease allele can be transmitted through a series of 
phenotypically normal heterozygous females, causing the 
appearance of skipped generations. The gene is passed from 
an affected father to all of his daughters, who, as carriers, 
transmit it to approximately half of their sons, who are 
affected. 

X-linked recessive inheritance is characterized 
by an absence of father-to-son transmission, 
skipped generations when genes are passed 
through female carriers, and a preponderance 
of affected males. 

The most common mating type involving X-linked reces¬ 
sive genes is the combination of a carrier female and a normal 
male. The carrier mother will transmit the disease gene to 
half of her sons and half of her daughters, on average. As 
Figure 5-9 shows, approximately half of the daughters of such 


a mating will be carriers, and half will be normal. Half of the 
sons will be normal, and half, on average, will have the disease. 

The other common mating type is an affected father and 
a homozygous unaffected mother (Fig. 5-10). Here, all of the 
sons must be unaffected, because the father can transmit only 
his Y chromosome to them. Because all of the daughters must 
receive the father’s X chromosome, they will all be heterozy¬ 
gous carriers. None of the children will manifest the disease, 
however. Because the father must transmit his X chromosome 
to his daughters and cannot transmit it to his sons, these risks, 
unlike those in the previous paragraph, are exact figures 
rather than probability estimates. 

A much less common mating type is that of an affected 
father and a carrier mother (Fig. 5-11). Half of the daughters 
will be heterozygous carriers, and half, on average, will be 
homozygous for the disease gene and thus affected. Half of 
the sons will be unaffected, and half will be affected. It may 
appear that father-to-son transmission of the disease has 
occurred, but the affected son has actually received the disease 
allele from his mother. 
















































90 


CHAPTER 5 Sex-Linked and Nontraditional Modes of Inheritance 


Mother 



Daughters: 
50% affected, 
50% carriers 


Sons: 

50% normal, 
50% affected 


FIG 5-11 Punnett square representation of the mating of a 
carrier female with a male affected with an X-linked recessive 


disease. Xi, Chromosome with normal allele; X 2 , chromo¬ 
some with disease allele. 



Recurrence risks for X-linked recessive disor¬ 
ders are more complex than for autosomal 
disorders. The risk depends on the genotype 
of each parent and the sex of the offspring. 


Occasionally, females who inherit only a single copy of an 
X-linked recessive disease allele can be affected with the 
disease. Imagine a female embryo that has received a normal 
clotting factor Vlll allele from one parent and a mutated 
clotting factor Vlll allele from the other. Ordinarily, X inac¬ 
tivation will result in approximately equal numbers of cells 
having active paternal and maternal X chromosomes. In this 
case, the female carrier would produce about 50% of the 
normal level of factor VIII and would be phenotypically 
normal. However, because X inactivation is a random process, 
it sometimes results in a heterozygous female in whom nearly 
all of the active X chromosomes happen to be the ones car¬ 
rying the disease-causing mutation. About 5% of female het¬ 
erozygotes exhibit hemophilia A and are termed manifesting 
heterozygotes. Because they usually maintain at least a small 
fraction of active normal X chromosomes, manifesting het¬ 
erozygotes tend to be relatively mildly affected. 



Because X inactivation is a random process, 
some female heterozygotes experience inacti¬ 
vation of most of the normal X chromosomes 
in their cells. These manifesting heterozygotes 
are usually mildly affected. 


Less commonly, females having only a single X chromo¬ 
some (Turner syndrome) have been seen with X-linked reces¬ 
sive diseases such as hemophilia A. Females can also be 
affected with X-linked recessive diseases as a result of trans¬ 
locations or deletions of X chromosome material (see Chapter 
6). These events are rare. 


X-Linked Dominant Inheritance 

X-linked dominant diseases are fewer and less prevalent than 
are X-linked recessive diseases. An example of an X-linked 
dominant disease is hypophosphatemic rickets, a disease in 
which the kidneys are impaired in their ability to reabsorb 
phosphate. This results in abnormal ossification, with bending 
and distortion of the bones (rickets). Another example is 
incontinentia pigmenti type 1, a disorder characterized by 
abnormal skin pigmentation, conical or missing teeth, and 
ocular and, in some cases, neurological abnormalities. This 
disorder is seen only in females. It is thought that hemizygous 
males are so severely affected that they do not survive to term. 
Heterozygous females, having one normal X chromosome, 
tend generally to have milder expression of X-linked 
dominant traits (just as heterozygotes for most autosomal 
dominant disease genes are less severely affected than are 
homozygotes). 

X-linked dominant inheritance is also observed in Rett 
syndrome, a neurodevelopmental disorder seen in 1 in 10,000 
females and in a much smaller fraction of males, most of 
whom do not survive to term. Rett syndrome is characterized 
by autistic behavior, intellectual disability, seizures, and gait 
ataxia. The severity of this condition varies substantially 
among affected females, reflecting the effects of random X 
inactivation. In mildly affected females, a large percentage of 
the X chromosomes that bear the disease-causing mutation 
have been randomly inactivated. About 95% of classic Rett 
syndrome cases are caused by mutations in the MECP2 gene, 
and most of these mutations are de novo events that occur in 
the paternal germline (consistent with a higher mutation rate 
in male gamete formation, discussed in Chapter 3). The 
protein product encoded by MECP2 binds to methylated CG 
sequences found in the 5' regions of other genes. After binding 
to these sequences, the protein helps to recruit other proteins 
that repress transcription by causing chromatin condensa¬ 
tion. Loss-of-function mutations in MECP2 result in the 
inappropriate expression of genes thought to be involved in 
brain development. 

Figure 5-12 illustrates a pedigree for X-linked dominant 
inheritance. As with autosomal dominant diseases, a person 
need inherit only a single copy of an X-linked dominant 
disease gene to manifest the disorder. Because females have 
two X chromosomes, either of which can potentially carry the 
disease gene, they are about twice as commonly affected as 
males (unless the disorder is lethal in males, as in incontinen¬ 
tia pigmenti). Affected fathers cannot transmit the trait to 
their sons. All of their daughters must inherit the disease 
gene, so all are affected. Affected females are usually hetero¬ 
zygotes and thus have a 50% chance of passing the disease 
allele to their daughters and sons. The characteristics of 
X-linked dominant and X-linked recessive inheritance are 
summarized in Table 5-2. As already mentioned, the distinc¬ 
tion between these categories can be blurred by incomplete 
penetrance in heterozygotes for X-linked dominant muta¬ 
tions and by the presence of disease in heterozygotes for 
X-linked recessive mutations (manifesting heterozygotes). 
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X-linked dominant diseases display character¬ 
istic patterns of inheritance. They are about 
twice as common in females as in males, 
skipped generations are uncommon, and 
father-to-son transmission is not seen. 

Y-Linked Inheritance 

Although it consists of approximately 60 Mb of DNA, the Y 
chromosome contains relatively few genes. Only a few dozen 
Y-linked, or holandric, genes have been identified. These 
include the gene that initiates differentiation of the embryo 
into a male (see Chapter 6), several genes that encode testis- 
specific spermatogenesis factors, a minor histocompatibility 
antigen (termed HY), and a gene in which mutations can 
cause hearing loss (DPNYl). Several housekeeping genes are 



XiY XiXg XiY X1X2 X2Y XiY X1X1 

FIG 5-12 Pedigree demonstrating the inheritance of an 
X-linked dominant trait. X,, Chromosome with normal allele; 
X 2 , chromosome with disease allele. 


located on the Y chromosome, and they all have inactivation- 
escaping homologs on the X chromosome. Transmission of 
Y-linked traits is strictly from father to son (Fig. 5-13). 

SEX-LIMITED AND SEX-INFLUENCED TRAITS 

Confusion sometimes exists regarding traits that are sex- 
linked and those that are sex-limited or sex-influenced. A 
sex-limited trait occurs in only one of the sexes—due, for 
instance, to anatomical differences. Inherited uterine or tes¬ 
ticular defects would be examples. A good example of a sex- 
influenced trait is male-pattern baldness, which occurs in 
both males and females but much more commonly in males. 
This is related in part to sex differences in hormone levels. 
Contrary to oft-stated belief, male-pattern baldness is not 
strictly X-linked, although variation in the X-linked andro¬ 
gen receptor gene is associated with baldness. Autosomal 
genes are also thought to influence male-pattern baldness, 
helping to explain apparent father-to-son transmission of 
this trait. 



FIG 5-13 Pedigree demonstrating the inheritance of a 
Y-linked trait. Transmission is exclusively male to male. 



TABLE 5-2 Comparison of the Major Attributes of X-Linked Dominant and X-Linked 
Recessive Inheritance Patterns 

ATTRIBUTE 

X-LINKED DOMINANT 

X-LINKED RECESSIVE 

Recurrence risk for heterozygous 

50% of sons affected: 50% of daughters 

50% of sons affected; 50% of daughters 

female x normal male mating 

affected 

heterozygous carriers 

Recurrence risk for affected male 

0% of sons affected: 100% of daughters 

0% of sons affected; 100% of daughters 

X normal female mating 

affected 

heterozygous carriers 

Transmission pattern 

Vertical; disease phenotype seen in 
generation after generation 

Skipped generations may be seen, 
representing transmission through carrier 
females 

Sex ratio 

Twice as many affected females as affected 
males (unless disease is lethal in males) 

Much greater prevalence of affected males; 
affected homozygous females are rare 

Other 

Male-to-male transmission is not seen; 
expression is less severe in female 
heterozygotes than in affected males 

Male-to-male transmission not seen; 
manifesting heterozygotes may be seen 
in females 


Compare with the inheritance patterns for autosomal diseases shown in Table 4-1. 
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MITOCHONDRIAL INHERITANCE 

The great majority of genetic diseases are caused by altera¬ 
tions in the nuclear genome. However, a small but significant 
number of diseases can be caused by mutations in mitochon¬ 
drial DNA. Because of the unique properties of the mito¬ 
chondria, these diseases display characteristic modes of 
inheritance and a large degree of phenotypic variability. 

Each human cell contains several hundred or more mito¬ 
chondria in its cytoplasm. Through the complex process of 
oxidative phosphorylation, mitochondria produce adenosine 
triphosphate (ATP), the energy source essential for cellular 
metabolism. Mitochondria are thus critically important for 
cell survival. 

The mitochondria have their own DNA molecules, which 
occur in several copies per mitochondrial body and consist 
of 16,569 base pairs arranged on a double-stranded circular 
molecule (Fig. 5-14). The mitochondrial genome encodes 

\ D loop / 

) region ( 



FIG 5-14 The circular mitochondrial DNA genome. Locations 
of protein-encoding genes (for reduced nicotinamide adenine 
dinucleotide [NADH] dehydrogenase, cytochrome c oxidase, 
cytochrome c oxidoreductase, and adenosine triphosphate 
[ATP] synthase) are shown, as are the locations of the two 
ribosomal RNA genes and 22 transfer RNA genes (desig¬ 
nated by single letters). The replication origins of the heavy 
(OH) and light (OL) chains and the noncoding D loop (also 
known as the control region) are shown. (Modified from 
MITOMAP. A Human Mitochondrial Genome Database. 
http://www.mitomap.org, 2008.) 


two ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs), 
and 13 polypeptides involved in oxidative phosphorylation. 
(Approximately 1000 nuclear DNA genes also encode poly¬ 
peptides that are transported into the mitochondria.) Tran¬ 
scription of mitochondrial DNA (mtDNA) takes place in the 
mitochondrion, independently of the nucleus. Unlike nuclear 
genes, mtDNA genes contain no introns. Because it is located 
in the cytoplasm, mtDNA is inherited exclusively through the 
maternal line (Fig. 5-15). Males do not transmit mtDNA 
to their offspring because sperm cells contain only a small 
number of mtDNA molecules, which are not incorporated 
into the developing embryo. (One isolated case of paternal 
transmission of a mitochondrial DNA mutation has been 
reported, but such events appear to be extremely rare.) 

The mutation rate of mtDNA is about 10 times higher 
than that of nuclear DNA. This is caused by a relative lack of 
DNA repair mechanisms in the mtDNA and also by damage 
from free oxygen radicals released during the oxidative phos¬ 
phorylation process. 

Because each cell contains a population of mtDNA mole¬ 
cules, a single cell can harbor some molecules that have an 
mtDNA mutation and other molecules that do not. This het¬ 
erogeneity in DNA composition, termed heteroplasmy, is an 
important cause of variable expression in mitochondrial 
diseases. The larger the percentage of mutant mtDNA mol¬ 
ecules, the more severe the expression of the disease. As cells 
divide, changes in the percentage of mutant alleles can occur 
through chance variation (identical in concept to genetic 
drift, discussed in Chapter 3) or because of a selective advan¬ 
tage (e.g., deletions produce a shorter mitochondrial DNA 
molecule that can replicate more quickly than a full-length 
molecule). 

Each tissue type requires a certain amount of ATP for 
normal function. Although some variation in ATP levels may 
be tolerated, there is typically a threshold level below which 
cells begin to degenerate and die. Organ systems with large 
ATP requirements and high thresholds tend to be the ones 
most seriously affected by mitochondrial diseases. For 
example, the central nervous system consumes about 20% of 
the body’s ATP production and therefore is often affected by 
mtDNA mutations. 

Like the globin disorders, mitochondrial disorders can be 
classified according to the type of mutation that causes them. 
Missense mutations in protein-coding mtDNA genes cause 
one of the best-known mtDNA diseases, Leber hereditary 
optic neuropathy (LHON). This disease, which affects about 
one in 10,000 persons, is characterized by rapid loss of vision 
in the central visual field as a result of optic nerve death. 
Vision loss typically begins in the third decade of life and is 
usually irreversible. Heteroplasmy is minimal in LHON, so 
expression tends to be relatively uniform and pedigrees for 
this disorder usually display a clear pattern of mitochondrial 
inheritance. 

Single-base mutations in a tRNA gene can result in myo¬ 
clonic epilepsy with ragged-red fiber syndrome (MERRF), a 
disorder characterized by epilepsy, dementia, ataxia (uncoor¬ 
dinated muscle movement), and myopathy (muscle disease). 
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FIG 5-15 A pedigree showing the inheritance of a 
disease caused by a mitochondrial DNA mutation. 
Only females can transmit the disease mutation to 
their offspring. Complete penetrance of the disease- 
causing mutation is shown in this pedigree, but 
heteroplasmy often results in incomplete pene¬ 
trance for mitochondrial diseases. 


MERRF is characterized by heteroplasmic mtDNA and is thus 
highly variable in its expression. Another example of a mito¬ 
chondrial disease caused by a single-base tRNA mutation is 
mitochondrial encephalomyopathy and stroke-like episodes 
(MELAS). Like MERRF, MELAS is heteroplasmic and highly 
variable in expression. 

The final class of mtDNA mutations consists of duplica¬ 
tions and deletions. These can produce Kearns-Sayre disease 
(muscle weakness, cerebellar damage, and heart failure); 
Pearson syndrome (infantile pancreatic insufficiency, pancy¬ 
topenia, and lactic acidosis); and chronic progressive external 
ophthalmoplegia (CPEO). Hundreds of disease-causing 
mtDNA mutations, including single-base mutations, dele¬ 
tions, and duplications, have been reported. 

Mitochondrial mutations are also associated with some 
common human diseases. A mitochondrial mutation causes 
a form of late-onset deafness, and the MELAS mutation is 
seen in 1% to 2% of persons with type 2 diabetes mellitus. 
Mitochondrial alterations may also be associated with some 
cases of Alzheimer disease, although it remains unclear 
whether mitochondrial mutations are a primary cause or a 
secondary event. It has also been suggested that mtDNA 
mutations, which accumulate through the life of an individ¬ 
ual as a result of free radical formation, could contribute to 
the aging process. 

The mitochondria, which produce ATP, have 
their own unique DNA. Mitochondrial DNA is 
maternally inherited and has a high mutation 
rate. A number of diseases are known to be 
caused by mutations in mitochondrial DNA. 

GENOMIC IMPRINTING 

Mendel’s experimental work with garden peas established 
that the phenotype is the same whether a given allele is inher¬ 
ited from the mother or the father. Indeed, this principle has 
long been part of the central dogma of genetics. However, 
research during the past two decades demonstrates that this 
principle does not always hold. For some human genes, one 
of the alleles is transcriptionally inactive (no mRNA is pro¬ 
duced), depending upon the parent from whom the allele was 


received. For example, an allele transmitted by the mother 
would be inactive, and the same allele transmitted by the 
father would be active. The normal individual would have 
only one transcriptionally active copy of the gene. This 
process of gene silencing is known as imprinting, and the 
transcriptionally silenced genes are said to be imprinted. At 
least 100 human genes are known to be imprinted, and most 
of these are located in genomic regions that contain clusters 
of several or more imprinted genes. 

Imprinted alleles tend to be heavily methylated (in con¬ 
trast to the nonimprinted copy of the allele, which typically 
is not methylated). The attachment of methyl groups to 5' 
regions of genes, along with histone hypoacetylation and 
condensation of chromatin, inhibit the binding of proteins 
that promote transcription. It should be apparent that this 
process is similar in many ways to X inactivation, discussed 
earlier in this chapter. 

Prader-Willi and Angelman Syndromes 

A striking disease example of imprinting results from a dele¬ 
tion of about 4 Mb of the long arm of chromosome 15. When 
this deletion is inherited from the father, the child manifests 
Prader-Willi syndrome (PWS), a disease whose features 
include short stature, hypotonia (poor muscle tone), small 
hands and feet, obesity, mild to moderate intellectual dis¬ 
ability, and hypogonadism (Fig. 5-16A). When the same 
deletion is inherited from the mother, the child develops 
Angelman syndrome (AS), which is characterized by severe 
intellectual disability, seizures, and an ataxic gait (see Fig. 
5-16B). Both diseases are seen in about 1 of every 15,000 
persons, and chromosome deletions are responsible for about 
70% of cases of both diseases. The deletions that cause PWS 
and AS are microscopically indistinguishable and affect the 
same group of genes. 

For some time, it was unclear how the same chromosomal 
deletion could produce such disparate results in different 
patients. Further analysis showed that the 4-Mb deletion (the 
critical region) contains several genes that normally are tran¬ 
scribed only on the chromosome inherited from the father. 
These genes are transcriptionally inactive (imprinted) on the 
copy of chromosome 15 inherited from the mother. Similarly, 
other genes in the critical region are active only on the 
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FIG 5-16 Illustration of the effect of imprinting on chromosome 15 deletions. A, Inheritance of 
the deletion from the father produces Prader-Willi syndrome (PWS) (note the inverted V-shaped 
upper lip, small hands, and truncal obesity). B, Inheritance of the deletion from the mother pro¬ 
duces Angelman syndrome (note the characteristic posture). C, Pedigrees illustrating the inheri¬ 
tance pattern of this deletion and the activation status of genes in the critical region. 


chromosome inherited from the mother and are inactive on 
the chromosome inherited from the father. Thus, several 
genes in this region are normally active on only one chromo¬ 
some (see Fig. 5-16C). If the single active copy of one of these 
genes is lost through a chromosome deletion, then no gene 
product is produced at all, and disease results. 

Molecular analysis using many of the tools and techniques 
outlined in Chapter 3 has identified several specific genes in 
the critical region of chromosome 15. The gene responsible 
for AS encodes a protein involved in ubiquitin-mediated 


protein degradation during brain development (consistent 
with the intellectual disability and ataxia observed in this 
disorder). In brain tissue, this gene is active only on the chro¬ 
mosome inherited from the mother; thus, a maternally trans¬ 
mitted deletion removes the single active copy of this gene. 
Several genes in the critical region are involved in PWS, and 
they are transcribed only on the chromosome transmitted by 
the father. 

Several mechanisms in addition to chromosome deletions 
can cause PWS and AS. One of these is uniparental disomy. 
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in which an individual inherits two copies of a chromosome 
from one parent and none from the other (see Chapter 6 
for further discussion). When two copies of the maternal 
chromosome 15 are inherited, PWS results because no 
active paternal genes are present in the critical region. Con¬ 
versely, disomy of the paternal chromosome 15 produces AS. 
Point mutations in the maternally transmitted copy of the 
AS gene produce approximately 10% to 20% of AS cases. 
Finally, a small percentage of PWS and AS result from 
defects in the imprinting control center on chromosome 15. 
This is the DNA sequence that helps to set and reset the 
imprint itself The great majority of PWS and AS cases can 
be diagnosed accurately by analyzing methylation patterns 
and DNA sequences in the critical region. Box 5-2 presents 
clinical issues of PWS from the perspective of a patient’s 
family. 

Beckwith-Wiedemann Syndrome 

A second example of imprinting in the human genome is 
given by Beckwith-Wiedemann syndrome, an overgrowth 
condition accompanied by an increased predisposition to 
cancer. Beckwith-Wiedemann syndrome is usually recogniz¬ 
able at birth because of large size for gestational age, neonatal 
hypoglycemia, a large tongue, creases on the ear lobe, and 
omphalocele (an abdominal wall defect). Some children with 
Beckwith-Wiedemann syndrome also develop asymmetrical 
overgrowth of a limb or one side of the face or trunk (i.e., 
hemihyperplasia). Children with Beckwith-Wiedemann syn¬ 
drome have an increased risk for developing Wilms tumor (a 
kidney cancer) and hepatoblastoma (a liver cancer). Both of 
these tumors can be treated effectively if they are detected 
early, so screening at regular intervals is an important part of 
management (see Chapter 15). 

As with Angelman syndrome, a minority of Beckwith- 
Wiedemann syndrome cases (about 20-30%) are caused by 
the inheritance of two copies of a chromosome from the 
father and no copy of the chromosome from the mother 
(uniparental disomy, in this case affecting chromosome 11). 
Several genes on the short arm of chromosome 11 are 
imprinted on either the paternal or maternal chromosome 
(Fig. 5-17). These genes are found in two separate, differen¬ 
tially methylated regions (DMRs). In DMRl, the gene that 
encodes insulin-like growth factor 2 (IGF2) is normally inac¬ 
tive on the maternally transmitted chromosome and active 
on the paternally transmitted chromosome. Normally, then, 
a person has only one active copy of IGF2. When two copies 
of the paternal chromosome are inherited (i.e., paternal uni¬ 
parental disomy) or there is loss of imprinting on the mater¬ 
nal copy of IGF2, an active IGF2 gene is present in double 
dose. This results in increased levels of 1GF2 during fetal 
development, contributing to the overgrowth features of 
Beckwith-Wiedemann syndrome. (Note that, in contrast to 
PWS and Angelman syndromes, which are produced by a 
missing gene product, Beckwith-Wiedemann syndrome is 
caused, at least in part, by overexpression of a gene product.) 

In 50% to 60% of cases, Beckwith-Wiedemann syndrome 
is caused by a loss of methylation or other alterations 


BOX 5-2 A Mother's Perspective of 
Prader-Willi syndrome 


We have a 3X-year-old son, John, who has Prader-Willi 
syndrome. Months before John was born, we were con¬ 
cerned about his well-being because he wasn't as active in 
utero as his older siblings had been. At the first sight of John, 
the doctors suspected that things "weren't quite right." John 
opened his eyes but made no other movements. He couldn't 
adequately suck, he required supplemental oxygen, and he 
was "puffy." He remained hospitalized for nearly 3 weeks. 
The next 3 years were filled with visits to occupational thera¬ 
pists, physical therapists, home health-care aides, early child¬ 
hood service providers, and speech therapists. 

From the day John was born, we searched diligently for a 
diagnosis. His father insisted that we need only love and help 
him. However, I wanted specifics on how to help him and 
knowledge from other parents who might have traveled a 
similar path. After extensive testing and three "chromosome 
checks," John's problem was diagnosed as Prader-Willi syn¬ 
drome (PWS). We were glad to be provided with some direc¬ 
tion and decided that we would deal with further challenges 
as they came upon us. We used what we learned about PWS 
to get started helping John reach his potential. We were not 
going to worry about all the potential problems John could 
have because of his PWS. 

John attends a special education preschool at the local 
elementary school 4 days a week. The bus ride takes about 
5 minutes, but it is long enough for John to very much antici¬ 
pate it each day. If he is ill, we have to tell him that the bus 
is broken. He attends a Sunday school class with children of 
a similar age. He misbehaves by saying "hi" and "bye" very 
loudly to each participant. He receives speech therapy once 
a week, and I spend at least 30 minutes each day with John 
practicing speech, cognitive, and play skills. John has not yet 
experienced the feeding difficulties commonly observed in 
children with PWS. However, excessive eating and weight 
gain are more common in older children with PWS. 

Compared with other 3-year-oid children, John struggles 
with speech and motor developmental milestones. Yet, he 
loves to play with his siblings and their friends and to look at 
books. In fact, we struggle to keep people from doing too 
many things for John because they might prevent him from 
attaining the same goal independently. We feel very privi¬ 
leged to have him in our family. 

Our expectations for John are that he achieves everything 
that is possible for him plus a little bit more. Indeed, some 
of his care providers are already impressed with his capabili¬ 
ties. I hope that his success is partly a result of the care 
and support that we have given to him. Moreover, I hope 
that John continues to overcome the daily challenges that 
face him. 


of DMR2. This region contains several genes, including 
CDKNl C, an important growth-regulating gene, and KGNQl, 
a gene that contains a regulatory IncRNA within one of its 
introns. Altered expression levels of these genes are thought 
to result in silencing of growth inhibitors and thus over¬ 
growth and increased predisposition to cancer. 
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FIG 5-17 Schematic of the organization of several imprinted 
genes on chromosome 11 pi 5.5 that are involved in the 
pathogenesis of Beckwith-Wiedemann syndrome and Silver- 
Russell syndrome. Beckwith-Wiedemann syndrome can 
arise from loss of imprinting of the growth promoting gene, 
IGF2, on the maternally transmitted chromosome, two copies 
of the paternal allele with an active IGF2 as a consequence 
of uniparental disomy, or faulty imprinting of the growth¬ 
suppressing gene, CDKNIC, on the maternally transmitted 
chromosome. Imprinting defects that lead to down-regulation 
of IGF2 on the paternal allele cause some cases of Silver- 
Russell syndrome. DMR, Differentially methylated region; 
red, genes that are not methylated and therefore expressed; 
green, genes that are methylated and therefore silenced. 

Silver-Russell Syndrome 

Silver-Russell syndrome is a clinically heterogeneous group 
of disorders characterized by growth retardation, proportion¬ 
ate short stature, leg length discrepancy, and a small, 
triangular-shaped face. About half of Silver-Russell syndrome 
cases are associated with a loss of methylation of DMRl on 
chromosome llpl5.5, leading to down-regulation of 1GF2 
and diminished growth. Thus, whereas up-regulation or 
extra copies of active IGF2 cause overgrowth in Beckwith- 
Wiedemann syndrome, down-regulation of IGF2 causes 
diminished growth in Silver-Russell syndrome. 

Some disease genes may be expressed differ¬ 
ently when inherited from one sex versus the 
other. This is genomic imprinting. It is typically 
associated with methylation of DNA and chro¬ 
matin condensation, which limit the action 
of transcription factors and decrease gene 
expression. 

Anticipation and Repeat Expansion 

Since the early part of the 20th century, it has been observed 
that some genetic diseases seem to display an earlier age of 
onset and/or more severe expression in the more recent gen¬ 
erations of a pedigree. This pattern is termed anticipation, 
and it has been the subject of considerable controversy and 
speculation. Many researchers believed that it was an artifact 
of better observation and clinical diagnosis in more recent 
times: a disorder that previously might have remained undi¬ 
agnosed until age 60 years might now be diagnosed at age 
40 years simply because of better diagnostic tools. Others, 
however, believed that anticipation could be a real biological 
phenomenon, although evidence for the actual mechanism 
remained elusive. 



FIG 5-18 A three-generation family affected with myotonic 
dystrophy. The degree of severity increases in each genera¬ 
tion. The grandmother (right) is only slightly affected, but the 
mother (left) has a characteristic narrow face and somewhat 
limited facial expression. The baby is more severely affected 
and has the facial features of children with neonatal-onset 
myotonic dystrophy, including an open, triangle-shaped 
mouth. The infant has more than 1000 copies of the trinucleo¬ 
tide repeat, whereas the mother and grandmother each have 
approximately 100 repeats. 


Molecular genetics has now provided good evidence that 
anticipation does have a biological basis. This evidence has 
come, in part, from studies of myotonic dystrophy, an auto¬ 
somal dominant disease that involves progressive muscle 
deterioration and myotonia (inability to relax muscles after 
contraction) (Fig. 5-18). Seen in approximately 1 in 8000 
persons, myotonic dystrophy is the most common muscular 
dystrophy that affects adults. This disorder is also typically 
characterized by cardiac arrhythmias (abnormal heart 
rhythms), testicular atrophy, insulin resistance, and cataracts. 
Most cases of myotonic dystrophy (termed myotonic dystro¬ 
phy type 1) are caused by mutations in DMPK, a protein 
kinase gene located on chromosome 19. 

Analysis of DMPK has shown that the disease-causing 
mutation is an expanded CTG trinucleotide repeat (see 
Chapter 3) that lies in the 3' untranslated portion of the gene 
(i.e., a region transcribed into mRNA but not translated into 
protein). The number of these repeats is strongly correlated 
with severity of the disease. Unaffected persons typically have 
5 to 37 copies of the repeat. Those with 50 to 100 copies may 
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be mildly affected or have no symptoms. Those with full¬ 
blown myotonic dystrophy have anywhere from 100 to several 
thousand copies of the repeated sequence. Expansion to large 
repeat numbers can produce congenital myotonic dystrophy; 
for reasons that are not well understood, these large expan¬ 
sions are transmitted almost exclusively by females. The 
number of repeats often increases with succeeding genera¬ 
tions: a mildly affected parent with 80 repeats might produce 
a severely affected offspring who has more than 1000 repeats 
(Fig. 5-19). As the number of repeats increases through suc¬ 
cessive generations, the age of onset decreases and severity 
often increases. Thus, expansion of this trinucleotide repeat 
is the primary cause of anticipation in myotonic dystrophy. 

How does a mutation in an untranslated portion of DMPK 
produce the many disease features of myotonic dystrophy? 
The expanded repeat produces an mRNA product that 
remains in the nucleus of the cell and produces toxic gain-of- 
function effects. The abnormal mRNA interacts with proteins 
that normally bind other RNA products to regulate their 
splicing. As a result, a number of proteins, including several 
that are expressed in heart and skeletal muscle, are abnor¬ 
mally formed, giving rise to some of the pleiotropic features 
of the disease phenotype. 

A second form of myotonic dystrophy, type 2, is caused by 
a 4-bp (CCTG) expanded repeat in a gene located on chro¬ 
mosome 3. Again, the repeat is located in an untranslated 
region of the gene (intron 1), and it results in a toxic mRNA 
that interferes with the normal function of RNA-binding 


proteins. The phenotype associated with this mutation is 
similar to that of myotonic dystrophy type 1, although it 
sometimes is less severe. Myotonic dystrophy thus illustrates 
several important genetic principles: anticipation, pleiotropy, 
and locus heterogeneity. 

Repeat expansions have now been identified as a cause of 
more than 20 genetic diseases (Table 5-3), which can be 
assigned to three broad categories. The first includes neuro¬ 
logical diseases, such as Huntington disease and most of the 
spinocerebellar ataxias, that are caused by a CAG or CTG 
repeat expansion in a protein-coding portion of the gene. The 
repeats generally expand in number from a normal range of 
10 to 35 to a disease-causing range of approximately 50 to 
100. Expansions tend to be larger when transmitted through 
the father than through the mother, and the mutations typi¬ 
cally have a gain-of-function effect. The second group con¬ 
sists of phenotypically more diverse diseases in which the 
expansions are again small in magnitude and are found in 
exons. The repeat sequence is heterogeneous, however, and 
anticipation is not a typical feature. The third category 
includes fragile X syndrome, myotonic dystrophy, two of the 
spinocerebellar ataxias, juvenile myoclonic epilepsy, and 
Friedreich ataxia. The repeat expansions are generally much 
larger than in the first two categories: The normal range is 
usually 5 to 50 trinucleotides, but the disease-causing range 
can vary from 100 to several thousand trinucleotides. The 
repeats are located outside the protein-coding regions of 
the gene in all of these disorders, and in some cases (e.g.. 


I 



FIG 5-19 A, Myotonic dystrophy pedigree illustrating anticipation. In this case, the age of onset 
for family members affected with an autosomal dominant disease is lower In more recent gen¬ 
erations. B, An autoradiogram from a Southern blot analysis of the myotonic dystrophy gene in 
three individuals. Individual a Is homozygous for a 4- to 5-repeat allele and is normal. Individual 
b has one normal allele and one disease allele of 175 repeats; this individual has myotonic dys¬ 
trophy. Individual c Is also affected with myotonic dystrophy and has one normal allele and a 
disease-causing allele of approximately 900 repeats. (B courtesy Dr. Kenneth Ward and Dr. Elaine 
Lyon, University of Utah Health Sciences Center.) 




























98 


CHAPTER 5 Sex-Linked and Nontraditional Modes of Inheritance 


TABLE 5-3 Diseases Associated with Repeat Expansions 



DISEASE 

DESCRIPTION 

REPEAT 

SEQUENCE 

NORMAL 

RANGE; DISEASE 
RANGE 

PARENT IN WHOM 
EXPANSION 
USUALLY OCCURS 

LOCATION OF 
EXPANSION 

Category 1 

Huntington disease 

Loss of motor 
control, dementia, 
affective disorder 

CAG 

6-34; 36-121 

More often through 
father 

Exon 

Spinai and buibar 
muscular atrophy 

Adult-onset motor- 
neuron disease 
associated with 
androgen 
insensitivity 

CAG 

9-36; 38-62 

More often through 
father 

Exon 

Spinocerebellar ataxia 
type 1 

Progressive ataxia, 
dysarthria, 
dysmetria 

CAG 

6-39; 40-82 

More often through 
father 

Exon 

Spinocerebellar ataxia 
type 2 

Progressive ataxia, 
dysarthria 

CAG 

15-24; 32-200 

— 

Exon 

Spinocerebellar ataxia 
type 3 (Machado- 
Joseph disease) 

Dystonia, distal 
muscular atrophy, 
ataxia, external 
ophthalmoplegia 

CAG 

13-36; 61-84 

More often through 
father 

Exon 

Spinocerebellar ataxia 
type 6 

Progressive ataxia, 
dysarthria, 
nystagmus 

CAG 

4-19; 20-33 


Exon 

Spinocerebellar ataxia 
type 7 

Progressive ataxia, 
dysarthria, retinal 
degeneration 

CAG 

4-35; 37-306 

More often through 
father 


Spinocerebellar ataxia 
type 17 

Progressive ataxia, 
dementia, 
bradykinesia, 
dysmetria 

CAG 

25-42; 47-63 


Exon 

Dentatorubral- 
pallidoluysian atrophy 
(Haw River syndrome) 

Cerebellar atrophy, 
ataxia, myoclonic 
epilepsy, 
choreoathetosis, 
dementia 

CAG 

7-34; 49-88 

More often through 
father 

Exon 

Huntington disease-like 

2 

Features very similar 
to those of 
Huntington disease 

CTG 

7-28; 66-78 


Exon 

Category 2 

Pseudoachondroplasia, 
multiple epiphyseal 
dysplasia 

Short stature, joint 
laxity, degenerative 
joint disease 

GAC 

5; 6-7 


Exon 

Oculopharyngeal 
muscular dystrophy 

Proximal limb 
weakness, 
dysphagia, ptosis 

GCG 

6; 7-13 


Exon 

Cleidocranial dysplasia 

Short stature, open 
skull sutures with 
bulging calvaria, 
clavicular 
hypoplasia, 
shortened fingers, 
dental anomalies 

GCG, GCT, 
GCA 

17; 27 (expansion 
observed in one 
family) 


Exon 

Synpolydactyly 

Polydactyly and 
syndactyly 

GCG, GCT, 
GCA 

15; 22-25 


Exon 
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TABLE 5-3 Diseases Associated with Repeat Expansions—cont'd 





NORMAL 

PARENT IN WHOM 




REPEAT 

RANGE; DISEASE 

EXPANSION 

LOCATION OF 

DISEASE 

DESCRIPTION 

SEQUENCE 

RANGE 

USUALLY OCCURS 

EXPANSION 

Category 3 

Myotonic dystrophy 

Muscle loss, cardiac 

CTG 

5-37; 50 to several 

Either parent, but 

3' untranslated 

(DM1; chromosome 

arrhythmia, 


thousand 

expansion to 


19) 

cataracts, frontal 



congenital region 



balding 



form through 
mother 


Myotonic dystrophy 

Muscle loss, cardiac 

CCTG 

10-26; 75-11,000 

— 

3' untranslated 

(DM2; chromosome 3) 

arrhythmia, 
cataracts, frontal 
balding 




region 

Friedreich ataxia 

Progressive limb 

GAA 

6-32; 200-1700 

Disorder is 

Intron 


ataxia, dysarthria. 



autosomal 



hypertrophic 



recessive, so 



cardiomyopathy. 



disease alleles are 



pyramidal 



inherited from 



weakness in legs 



both parents 


Fragile X syndrome 

Intellectual disability. 

CGG 

4-39; 200-900 

Exclusively through 

5' untranslated 

(FRAXA) 

large ears 
and jaws, 
macroorchidism in 
males 



mother 

region 

Fragile site (FRAXE) 

Mild intellectual 

GCC 

6-35; >200 

More often through 

5' untranslated 


disability 



mother 

region 

Spinocerebellar ataxia 

Adult-onset ataxia, 

CTG 

16-34; >74 

More often through 

3' untranslated 

type 8 

dysarthria, 

nystagmus 



mother 

region 

Spinocerebellar ataxia 

Ataxia and seizures 

ATTCT 

10-20; 500-4500 

More often through 

Intron 

type 10 




father 


Spinocerebellar ataxia 

Ataxia, eye 

CAG 

7-45; 55-78 

— 

5' untranslated 

type 12 

movement 

disorders; variable 
age at onset 




region 

Progressive myoclonic 

Juvenile-onset 

12-bp repeat 

2-3; 30-75 

Autosomal recessive 

5' untranslated 

epilepsy type 1 

convulsions. 

motif 


inheritance, so 

region 


myoclonus. 



transmitted by 



dementia 



both parents 



myotonic dystrophy) the mutation produces a harmful RNA 
product. Repeat expansions are often larger when they are 
transmitted through the mother. Anticipation is seen in most 
of the diseases in the first and third categories. 

Anticipation refers to progressively earlier or 
more severe expression of a disease in more 
recent generations. Expansion of DNA repeats 
has been shown to cause anticipation in 
some genetic diseases. These diseases can be 
divided into three major categories, depending 
on the size of the expansion, the location of 
the repeat, the phenotypic consequences 
of the expansion, the effect of the mutation, 
and the parent in whom large expansions typi¬ 
cally occur. 


The Fragile X Story: Molecular Genetics Explains 
a Puzzling Pattern of Inheritance 

Since the 19th century, it has been observed that there is an 
approximate 25% excess of males among persons with intel¬ 
lectual disability. This excess is partly explained by X-linked 
conditions that cause cognitive impairment, of which the 
fragile X syndrome is the most common. In addition to intel¬ 
lectual disability, fragile X syndrome is characterized by a dis¬ 
tinctive facial appearance, with large ears and long face (Fig. 
5-20), hypermobile joints, and macroorchidism (increased 
testicular volume) in postpubertal males. The degree of intel¬ 
lectual disability tends to be milder and more variable in 
females than in males. The syndrome is termed “fragile X” 
because the X chromosomes of affected persons, when cul¬ 
tured in a medium that is deficient in folic acid, sometimes 
exhibit breaks and gaps near the tip of the long arm (Fig. 5-21). 
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C 

FIG 5-20 Boys with fragile X syndrome. Note the long faces, prominent jaws, and large ears 
and the similar characteristics of children from different ethnic groups: European (A), Asian (B), 
and Latin American (C). 





FIG 5-21 An X chromosome from a male with fragile X syn¬ 
drome, showing an elongated, condensed region near the tip 
of the long arm. (From Stein CK. Applications of cytogenetics 
in modern pathology. In: McPherson RA, Pincus MR, eds. 
Henrys Clinical Diagnosis and Management by Laboratory 
Methods. 21st ed. Philadelphia: Saunders; 2006.) 


Although the presence of a single fragile X mutation is 
sufficient to cause disease in either males or females, the 
prevalence of this condition is higher in males (1/4000) than 
in females (1/8000). The lower degree of penetrance in 
females, as well as variability in expression, reflects variation 
in patterns of X inactivation (i.e., the percentage of active X 
chromosomes that carry the disease-causing mutation). 
Males who have affected descendants but are not affected 
themselves are termed normal transmitting males. In the mid- 
1980s, studies of fragile X syndrome pedigrees revealed a 
perplexing pattern: the mothers of transmitting males had a 
much lower percentage of affected sons than did the daugh¬ 
ters of these males (Fig. 5-22). Because the mothers and 
daughters of normal transmitting males are both obligate 
carriers of the X-linked mutation, they should have equal 
risks of producing affected sons. Daughters of normal 


IV 


V 



Affected 

individuals 

have 

>200 

repeats 


FIG 5-22 A pedigree showing the inheritance of the fragile X 
syndrome. Females who carry a premutation (50 to 200 CGG 
repeats) are shown by dotted symbols. Affected individuals 
are represented by solid symbols. A normal transmitting 
male, who carries a premutation of 70 to 90 repeat units, is 
designated NTM. Note that the number of repeats increases 
each time the mutation is passed through another female. 
Also, only 5% of the NTM's sisters are affected, and only 9% 
of his brothers are affected, but 40% of his grandsons and 
16% of his granddaughters are affected. This is the Sherman 
paradox. 
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transmitting males were never affected with the disorder, but 
these women’s sons could be affected. This pattern, dubbed 
the Sherman paradox, appeared to be inconsistent with the 
rules of X-linked inheritance. 

Many mechanisms were proposed to explain this pattern, 
including autosomal and mitochondrial modifier loci. Reso¬ 
lution of the Sherman paradox came only with the identifica¬ 
tion of the disease-causing gene, labeled FMRl. DNA sequence 
analysis showed that the 5' untranslated region of the gene 
contains a CGG repeat unit that is present in 6 to 50 copies 
in unaffected persons. Those with fragile X syndrome have 
200 to 1000 or more CGG repeats (a full mutation). An inter¬ 
mediate number of repeats, ranging approximately from 50 
to 200 copies, is seen in normal transmitting males and their 
female offspring and is termed a premutation. When these 
female offspring transmit the gene to their offspring, there is 
sometimes an expansion from the premutation of 50 to 200 
repeats to the full mutation of more than 200 repeats. These 
expansions do not occur in male transmission. Furthermore, 
premutations tend to become larger in successive generations, 
and larger premutations are more likely to expand to a full 
mutation. These findings explain the Sherman paradox. 
Males with the premutation do not have daughters with 
fragile X syndrome because expansion to the full mutation 
occurs only in female transmission. Grandsons and great- 
grandsons of transmitting males are more likely to be affected 
by the disorder than are brothers of transmitting males 
because of progressive repeat expansion through successive 
generations of female permutation carriers. 

Measurement of mRNA transcribed from FMRl has 
shown that the highest mRNA expression levels are in the 
brain, as would be expected. Persons with normal and pre¬ 
mutation FMRl genes both produce mRNA. In fact, mRNA 


production is elevated in those with premutations, and it has 
been shown that this mRNA accumulates in the nucleus and 
has toxic effects, as do mutated mRNAs in myotonic dystro¬ 
phy. Consequently, about more than half of males with pre¬ 
mutations develop a neurological disease characterized by 
ataxia and tremors by age 70 (the risk for female carriers is 
less than 20%). Approximately 25% of females with FMRl 
premutations experience premature ovarian failure (amenor¬ 
rhea before age 40 years), again because of toxic mRNA 
effects. In contrast, those with full mutations have no FMRl 
mRNA in their cells, indicating no transcription of the gene. 
The CGG repeat is heavily methylated in those with the full 
mutation, as is a series of CG sequences 5' of the gene. The 
degree of methylation, which is likely to influence transcrip¬ 
tion of FMRl, is correlated with severity of expression of 
the disorder. A small percentage of persons with fragile X 
syndrome (<5%) do not have an expansion of the CGG 
repeat but instead have other loss-of-function mutations 
in FMRL 

The protein product of FMRl, FMRP, binds to RNA and 
is capable of shuttling back and forth between the nucleus 
and the cytoplasm. It appears that FMRP is involved in trans¬ 
porting mRNA from the nucleus to the cytoplasm, and it is 
thought to be involved in regulating the translation of mRNA. 

Identification of the FMRl gene and the nature of its 
repeat expansion have already explained a great deal about 
the inheritance and expression of the fragile X syndrome. 
These advances have also improved diagnostic accuracy for 
the condition, because cytogenetic analysis of chromosomes 
often fails to identify fragile X heterozygotes. In contrast, 
DNA diagnosis, which consists of measurement of the length 
of the CGG repeat sequence and the degree of methylation 
of FMRl, is much more accurate. 


I STUDY QUESTIONS 


1. Females have been observed with five X chromosomes in 
each somatic cell. How many Barr bodies would such 
females have? 

2. Explain why 8% to 10% of female carriers of the DMD 
gene have muscle weakness. 

3. For X-linked recessive disorders, the ratio of affected 
males to affected females in populations increases as the 
disease frequency decreases. Explain this in terms of gene 
and genotype frequencies. 

4. Eigure 5-23 shows the inheritance of hemophilia A in a 
family. What is the risk that the male in generation IV is 
affected with hemophilia A? What is the risk that the 
female in generation IV is a heterozygous carrier? What 
is the risk that she is affected by the disorder? 

5. Pedigrees for autosomal dominant and X-linked domi¬ 
nant diseases are sometimes difficult to distinguish. 
Name as many features as you can that would help to tell 
them apart. 

6. How would you distinguish mitochondrial inheritance 
from other modes of inheritance? 


7. A man with Becker muscular dystrophy marries a phe- 
notypically normal woman. On average, what percentage 
of their male offspring will be affected and what percent¬ 
age of their female offspring will be affected by this 
disorder? 



FIG 5-23 Pedigree for study question 4. 
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8. A female carrier of a Duchenne muscular dystrophy 
mutation marries a phenotypically normal man. On 
average, what percentage of their male offspring will be 
affected and what percentage of their female offspring 
will be affected with the disorder? 

9. A boy and his brother both have hemophilia A. If there 
is no family history of hemophilia A in previous genera¬ 
tions, what is the probability that the boys’ aunt (the 


mother’s sister) is a heterozygous carrier of the disease 
gene? 

10. It is possible to create a zygote from two copies of the 
maternal genome alone. In amphibians, the zygote will 
develop and mature into an adult without fertilization by 
a sperm cell (this process is known as parthenogenesis). 
The same experiment has been attempted in mice, but it 
always results in early prenatal death. Explain this. 
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6 


Clinical Cytogenetics: The Chromosomal 

Basis of Human Disease 


The previous two chapters have dealt with single-gene dis¬ 
eases. We turn now to diseases caused by alterations in the 
number or structure of chromosomes. The study of chromo¬ 
somes and their abnormalities is called cytogenetics. 

Chromosome abnormalities are responsible for a signifi¬ 
cant fraction of genetic diseases, occurring in approximately 
1 of every 150 live births. They are the leading known cause 
of intellectual disability and pregnancy loss. Chromosome 
abnormalities are seen in 50% of first-trimester and 20% of 
second-trimester spontaneous abortions. Thus, they are an 
important cause of morbidity and mortality. 

As in other areas of medical genetics, advances in molecu¬ 
lar genetics have contributed many new insights in the field 
of cytogenetics. For example, molecular techniques have per¬ 
mitted the identification of chromosome abnormalities, such 
as deletions, that affect very small regions. In many cases, 
specific genes that contribute to the phenotypes of cytoge¬ 
netic syndromes are being pinpointed. In addition, the ability 
to identify DNA polymorphisms in parents and offspring has 
enabled researchers to specify whether an altered chromo¬ 
some is derived from the mother or from the father. This has 
increased our understanding of the biological basis of meiotic 
errors and chromosome abnormalities. 

In this chapter we discuss abnormalities of chromosome 
number and structure. The genetic basis of sex determination 
is reviewed, the role of chromosome alterations in cancer is 
examined, and several diseases caused by chromosomal insta¬ 
bility are discussed. Emphasis is given to the recent contribu¬ 
tions of molecular genetics to cytogenetics. 

CYTOGENETIC TECHNOLOGY AND 
NOMENCLATURE 

Although it was possible to visualize chromosomes under 
microscopes as early as the mid-1800s, it was quite difficult 
to observe individual chromosomes and thus to count the 
number of chromosomes in a cell or to examine structural 
abnormalities. Beginning in the 1950s, several techniques 
were developed that improved our ability to observe chromo¬ 
somes. These included the use of spindle poisons, such as 
colchicine and colcemid, that arrest dividing somatic cells in 
metaphase, when chromosomes are maximally condensed 
and easiest to see; the use of a hypotonic (low-salt) solution, 
which causes swelling of cells, rupture of the nucleus, and 


better separation of individual chromosomes; and the use of 
staining materials that are absorbed differently by different 
parts of chromosomes, producing the characteristic light and 
dark bands that help to identify individual chromosomes. 

Our ability to study chromosomes has been 
improved by the visualization of chromosomes 
in metaphase, by hypotonic solutions that 
cause nuclear swelling, and by staining tech¬ 
niques that label chromosome bands. 

Chromosomes are typically analyzed by collecting a living 
tissue (usually blood), culturing the tissue for the appropriate 
amount of time (usually 48 to 72 hours for peripheral lym¬ 
phocytes), adding colcemid to produce metaphase arrest, 
harvesting the cells, placing the cell sediment on a slide, rup¬ 
turing the cell nucleus with a hypotonic saline solution, stain¬ 
ing with a designated nuclear stain, and photographing the 
metaphase spreads of chromosomes on the slide. The images 
of the 22 pairs of autosomes are arranged according to length, 
with the sex chromosomes in the right-hand corner. This 
ordered display of chromosomes is termed a karyogram or 
karyotype (Fig. 6-1). (The term karyotype refers to the 
number and type of chromosomes present in an individual, 
and karyogram is now often used to designate the printed 
display of chromosomes.) Currently, computerized image 
analyzers are usually used to display chromosomes. 

After sorting by size, chromosomes are further classified 
according to the position of the centromere. If the centro¬ 
mere occurs near the middle of the chromosome, the chro¬ 
mosome is said to be metacentric (Fig. 6-2). An acrocentric 
chromosome has its centromere near the tip, and submeta- 
centric chromosomes have centromeres somewhere between 
the middle and the tip. The tip of each chromosome is the 
telomere. The short arm of a chromosome is labeled p (for 
“petite”), and the long arm is labeled q. In metacentric chro¬ 
mosomes, where the arms are of roughly equal length, the p 
and q arms are designated by convention. 

A karyotype, or karyogram, is a display of 
chromosomes ordered according to length. 
Depending on the position of the centromere, 
a chromosome may be acrocentric, submeta- 
centric, or metacentric. 
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FIG 6-1 A banded karyogram (karyotype) of a normal male. The banded metaphase chromo¬ 
somes are arranged from largest to smallest. 
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FIG 6-2 Metacentric, submetacentric, and acrocentric chro¬ 
mosomes. Note the stalks and satellites present on the short 
arms of the acrocentric chromosomes. 


A normal female karyotype is designated 46,XX; a normal 
male karyotype is designated 46,XY. The nomenclature for 
various chromosome abnormalities is summarized in Table 
6-1 and is indicated for each condition discussed in this 
chapter. 

Chromosome Banding 

Early karyotypes were useful in counting the number of chro¬ 
mosomes, but structural abnormalities, such as balanced 
rearrangements or small chromosomal deletions, were often 
undetectable. Staining techniques were developed in the 
1970s to produce the chromosome bands characteristic of 
modern karyotypes. Chromosome banding helps greatly in 
the detection of deletions, duplications, and other structural 


abnormalities, and it facilitates the correct identification of 
individual chromosomes. The major bands on each chromo¬ 
some are systematically numbered (Fig. 6-3). For example, 
14q32 refers to the second band in the third region of the 
long arm of chromosome 14. Sub-bands are designated by 
decimal points following the band number (e.g., 14q32.3 is 
the third sub-band of band 2). 

Several chromosome-banding techniques are used in cyto¬ 
genetics laboratories. Quinacrine banding (Q-banding) 
was the first staining method used to produce specific 
banding patterns. This method requires a fluorescence 
microscope and is no longer as widely used as Giemsa 
banding (G-banding). To produce G-bands, a Giemsa 
stain is applied after the chromosomal proteins are partially 
digested by trypsin. Reverse banding (R-banding) requires 
heat treatment and reverses the usual white and black 
pattern that is seen in G-bands and Q-bands. This method is 
particularly helpful for staining the distal ends of chromo¬ 
somes. Other staining techniques include C-banding and 
nucleolar organizing region stains (NOR stains). These latter 
methods specifically stain certain portions of the chromo¬ 
some. C-banding stains the constitutive heterochromatin, 
which usually lies near the centromere, and NOR staining 
highlights the satellites and stalks of acrocentric chromo¬ 
somes (see Fig. 6-2). 

High-resolution banding involves staining chromosomes 
during prophase or early metaphase (prometaphase), before 
they reach maximal condensation. Because prophase and 
prometaphase chromosomes are more extended than 
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TABLE 6-1 Standard Nomenclature for Chromosome Karyotypes 


KARYOTYPE 

46, XY 

47, XX,+21 

47,XY,+21(10]/46,XY[101 

46,XY,del(4)(p14) 

46,XX,dup(5)(p14p15.3) 

45,XY,der(13;14)(q10:q10) 


46,XY,t(11:22)(q23:q22) 

46,XX,inv(3)(p21q13) 

46,X,r(X)(p22.3q28) 

46,X,i(Xq) 


DESCRIPTION 

Normal male chromosome constitution 
Female with trisomy 21, Down syndrome 

Male who is a mosaic of trisomy 21 cells and normal cells (10 cells scored for each karyotype) 

Male with distal and terminal deletion of the short arm of chromosome 4 from band pi 4 to terminus 
Female with a duplication within the short arm of chromosome 5 from bands pi 4 to pi 5.3 
A male with a balanced Robertsonian translocation of chromosomes 13 and 14; karyotype shows 
that one normal 13 and one normal 14 are missing and replaced with a derivative chromosome 
composed of the long arms of chromosomes 13 and 14 
A male with a balanced reciprocal translocation between chromosomes 11 and 22; the breakpoints 
are at 11q23 and 22q22 

An inversion on chromosome 3 that extends from p21 to q13; because it includes the centromere, 
this is a pericentric inversion 

A female with one normal X chromosome and one ring X chromosome formed by breakage at 
bands p22.3 and q28 with subsequent fusion 

A female with one normal X chromosome and an isochromosome of the long arm of the X 
chromosome 


metaphase chromosomes, the number of bands observable 
for all chromosomes increases from about 300 to 450 (as in 
Fig. 6-3) to as many as 800. This allows the detection of less 
obvious abnormalities usually not seen with conventional 
banding. 

Chromosome bands help to identify individual 
chromosomes and structural abnormalities 
in chromosomes. Banding techniques include 
quinacrine, Giemsa, reverse, C, and NOR 
banding. High-resolution banding, using 
prophase or prometaphase chromosomes, 
increases the number of observable bands. 

Fluorescence in Situ Hybridization 

In the widely used fluorescence in situ hybridization (FISH) 

technique, a labeled single-stranded DNA segment (probe) is 
exposed to denatured metaphase, prophase, or interphase 
chromosomes from a patient. The probe undergoes comple¬ 
mentary base pairing (hybridization) only with the comple¬ 
mentary DNA sequence at a specific location on one of the 
denatured chromosomes. Because the probe is labeled with a 
fluorescent dye, the location at which it hybridizes with the 
patient’s chromosomes can be visualized under a fluorescence 
microscope. A common use of FISH is to determine whether 
a portion of a chromosome is deleted in a patient. In an unaf¬ 
fected person, a probe hybridizes in two places, reflecting the 
presence of two homologous chromosomes in a somatic cell 
nucleus. If a probe from the chromosome segment in ques¬ 
tion hybridizes to only one of the patient’s chromosomes, 
then the patient probably has the deletion on the copy of the 
chromosome to which the probe fails to hybridize. FISH pro¬ 
vides considerably better resolution than high-resolution 
banding approaches and can typically detect deletions as 
small as 1 million base pairs (1 Mb) in size. It is widely used 


to detect common deletion syndromes such as Prader-Willi 
syndrome (microdeletion of 15qll.2) and Williams syn¬ 
drome (microdeletion of 7qll.2) (discussed later). 

Extra copies of a chromosome region can also be detected 
using FISH. In this case, the probe hybridizes in three or more 
places instead of two. Combinations of FISH probes can also 
be used to detect chromosome rearrangements such as trans¬ 
locations (see later discussion). 

Figure 6-4A illustrates a FISH result for a child who is 
missing a small piece of the short arm of chromosome 17. 
Although a centromere probe (used as a control) hybridizes 
to both copies of chromosome 17, a probe corresponding to 
a specific region of 17p hybridizes to only one chromosome. 
This demonstrates the deletion that causes the Smith-Magenis 
syndrome (see Fig. 6-4B; also see Table 6-3). 

Because FISH detection of missing or extra chromo¬ 
somes can be carried out with interphase chromosomes, 
it is not necessary to stimulate cells to divide in order to 
obtain metaphase chromosomes (a time-consuming proce¬ 
dure required for traditional microscopic approaches). This 
allows analyses and diagnoses to be completed more rapidly. 
FISH analysis of interphase chromosomes is commonly used 
in the prenatal detection of fetal chromosome abnormalities 
and in the analysis of chromosome rearrangements in tumor 
cells. 

The FISH technique has been extended by using multiple 
probes, each labeled with a different color, so that several of 
the most common numerical abnormalities (e.g., those of 
chromosomes 13, 18, 21, X, and Y) can be tested simultane¬ 
ously in the same cell. In addition, techniques such as spectral 
karyotyping use varying combinations of five different fluo¬ 
rescent probes in conjunction with special cameras and 
image-processing software so that each chromosome is 
uniquely colored (painted along its entire length with a series 
of same-colored probes) for ready identification. Such images 
can be especially useful for identifying small chromosome 
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1 Negative or pale staining ‘Q’ and ‘G’ bands 
i Positive ‘R’ bands 

1 Positive ‘Q’ and ‘G’ bands 
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FIG 6-3 A schematic representation of the banding pattern of a G-banded karyotype; 300 
bands are represented in this ideogram. The short and long arms of the chromosomes are des¬ 
ignated, and the segments are numbered according to the standard nomenclature adopted at 
the Paris conference in 1971. In this illustration, both sister chromatids are shown for each 
chromosome. 


X 



FIG 6-4 A, A fluorescence in situ hybridization (FISH) result. The thinner arrows point to a 
centromere-hybridizing probe for chromosome 17, and the fh/c/fer arrow points to a probe that 
hybridizes to 17p. The latter probe reveals only one spot in this individual, who has a deletion 
of 17p, producing the Smith-Magenis syndrome. (Courtesy Dr. Arthur Brothman, University 
of Utah Health Sciences Center.) B, Face of an infant girl with Smith-Magenis syndrome. Note 
the broad forehead and relatively flat face. (Courtesy Dr. Marilyn C. Jones, Children's Hospital, 
San Diego.) 
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FIG 6-5 Spectral karyotype. An application of spectral karyotyping is demonstrated by the iden¬ 
tification of a rearrangement between chromosomes 2 and 22. Note that a portion of chromo¬ 
some 2 (purple) has exchanged places with a portion of chromosome 22 (yellow). (Courtesy Dr. 
Arthur Brothman, University of Utah Health Sciences Center.) 


rearrangements (Fig. 6-5). With the increasing use of tech¬ 
niques such as comparative genomic hybridization (see 
below), spectral karyotyping is now less used in clinical 
practice. 

FISH is a technique in which a labeled probe 
is hybridized to metaphase, prophase, or inter¬ 
phase chromosomes. FISH can be used to test 
for missing or additional chromosomal mate¬ 
rial as well as chromosome rearrangements. 
The FISH technique can be extended with mul¬ 
tiple colors to detect several possible altera¬ 
tions of chromosome number simultaneously. 
Multiple probes can be used to paint each 
chromosome with a unique color, facilitating 
the detection of structural rearrangements. 

Comparative Genomic Hybridization 

Losses or duplications of whole chromosomes or specific 
chromosome regions can be detected by a widely used tech¬ 
nique known as comparative genomic hybridization (CGH) 
(Fig. 6-6). DNA is extracted from a test source, such as cells 
from a tumor or whole-blood cells from a patient. The DNA 
is then labeled with a substance that exhibits one color (e.g., 
red) under a fluorescence microscope; DNA taken from 
normal control cells is labeled with a second color (e.g., 
green). In the earliest version of CGH, both sets of DNA are 
then hybridized to normal metaphase chromosomes on a 
slide. If any chromosome region is duplicated in the tumor 
cell, then the corresponding region of the metaphase chro¬ 
mosome will hybridize with excess quantities of the 


red-labeled DNA. This region will appear red under the 
microscope. Conversely, if a region is deleted in the tumor 
cell, then the corresponding region of the metaphase chro¬ 
mosome will hybridize only with green-labeled control DNA, 
and the region will appear green under the microscope. CGH 
is especially useful in scanning for deletions and duplications 
of chromosome material in cancer cells, where detection of 
such alterations can help to predict the type and/or severity 
of the cancer. 

A serious limitation of CGH when used with metaphase 
chromosomes is that deletions or duplications smaller than 
5 to 10 Mb cannot be detected microscopically. Higher reso¬ 
lution is offered by array CGH (aCGH), in which test and 
control DNA is hybridized with a microarray (see Chapter 3) 
that contains hundreds of thousands to millions of oligo¬ 
nucleotide probes (see Chapter 3) whose DNA sequences 
correspond to specific regions of the genome. These microar¬ 
rays provide resolution to 50 to 100 kb or even less, allowing 
the detection of duplications and deletions that may affect 
only a single gene. 

Array CGH offers a number of other advantages over tra¬ 
ditional karyotype analysis. The process is highly automated, 
requiring less time from laboratory personnel. There is no 
need for dividing cells (in contrast to the analysis of meta¬ 
phase chromosomes), and less than a microgram of DNA is 
sufficient for analysis of the entire genome. For these reasons, 
aCGH has become one of the most commonly used tech¬ 
niques in cytogenetics laboratories. The primary disadvan¬ 
tage of aCGH is that it cannot detect balanced rearrangements 
of chromosomes (i.e., reciprocal translocations or inversions) 
because the quantity of chromosome material remains 
unchanged. 
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FIG 6-6 The comparative genomic hybridization (CGH) technique. A, Red-labeled test DNA (from 
a tumor sample in this case) and green-labeled reference DNA (from normal cells) are both 
denatured and hybridized to normal chromosomes. The ratio of green to red signal on the hybrid¬ 
ized chromosomes indicates the location of duplications (red signal) or deletions (green signal) 
in the tumor chromosomes. B, Array CGH: Test and normal DNA are hybridized to probes embed¬ 
ded in a microarray. Duplications are indicated by hybridization of more red-labeled DNA to a 
probe containing a DNA sequence that is complementary to the duplicated region. Conversely, 
hybridization of only green-labeled DNA (reference DNA) indicates a deletion of the correspond¬ 
ing region. C, In a patient with DiGeorge sequence, an aCGH test was performed in which the 
patient's DNA was labeled green and the control DNA was labeled red. The figure demonstrates 
a lack of green signal and excess red signal, signifying a deletion of chromosome 22q11. 


The CGH technique, in which differentially 
labeled DNA from test and control sources is 
hybridized to probes in microarrays, allows the 
detection of chromosome duplications and 
deletions but not balanced rearrangements. 
Array CGH can detect deletions and duplica¬ 
tions shorter than 100 kb and requires only 
small amounts of DNA. 


ABNORMALITIES OF 

CHROMOSOME NUMBER _ 

Polyploidy 

A cell that contains a multiple of 23 chromosomes in its 
nucleus is said to be euploid (Greek, eu = “good,” ploid = 
“set”). Thus, haploid gametes and diploid somatic cells are 
euploid. Polyploidy, the presence of a complete set of extra 
chromosomes in a cell, is seen commonly in plants and often 
improves their agricultural value. Polyploidy also occurs in 
humans, although much less frequently. The polyploid 


conditions that have been observed in humans are triploidy 
(69 chromosomes in the nucleus of each cell) and tetraploidy 
(92 chromosomes in each cell nucleus). The karyotypes for 
these two conditions are designated 69,XXX and 92,XXXX, 
respectively (assuming that all of the sex chromosomes were 
X; other combinations of the X and Y chromosomes may be 
seen). Because the number of chromosomes present in each 
of these conditions is a multiple of 23, the cells are euploid 
in each case. However, the additional chromosomes encode a 
large amount of surplus gene product, causing multiple 
anomalies such as defects of the heart and central nervous 
system. 

Triploidy is seen in about 1 in 10,000 live births, but it 
accounts for an estimated 15% of the chromosome abnor¬ 
malities occurring at conception. Thus, the vast majority of 
triploid conceptions are spontaneously aborted, and this con¬ 
dition is one of the most common causes of fetal loss in the 
first two trimesters of pregnancy. Triploid fetuses that survive 
to term typically die shortly after birth. The most common 
cause of triploidy is the fertilization of an egg by two sperm 
(dispermy). The resulting zygote receives 23 chromosomes 
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from the egg and 23 chromosomes from each of the two 
sperm cells. Triploidy can also be caused by the fusion of an 
ovum and a polar body, each containing 23 chromosomes, 
and subsequent fertilization by a sperm cell. Meiotic failure, 
in which a diploid sperm or egg cell is produced, can also 
produce a triploid zygote. 

Tetraploidy is much rarer than triploidy, both at concep¬ 
tion and among live births. It has been recorded in only a few 
live births, and those infants survived for only a short period. 
Tetraploidy can be caused by a mitotic failure in the early 
embryo: all of the duplicated chromosomes migrate to one 
of the two daughter cells. It can also result from the fusion of 
two diploid zygotes. 

Cells that have a multiple of 23 chromosomes 
are said to be euploid. Triploidy (69 chromo¬ 
somes) and tetraploidy (92 chromosomes) are 
polyploid conditions found in humans. Most 
polyploid conceptions are spontaneously 
aborted, and all are incompatible with long¬ 
term survival. 

Autosomal Aneuploidy 

Cells that contain missing or additional individual chromo¬ 
somes are termed aneuploid (not a multiple of 23 chromo¬ 
somes). Usually only one chromosome is affected, but it is 


possible for more than one chromosome to be missing or 
duplicated. Aneuploidies of the autosomes are among the 
most clinically important of the chromosome abnormalities. 
They consist primarily of monosomy (the presence of only 
one copy of a chromosome in an otherwise diploid cell) and 
trisomy (three copies of a chromosome). Autosomal mono¬ 
somies are nearly always incompatible with survival to term, 
so only a small number of them have been observed among 
live-born individuals. In contrast, some trisomies are seen 
with appreciable frequencies among live births. The fact that 
trisomies produce less-severe consequences than monoso¬ 
mies illustrates an important principle: the body can tolerate 
excess genetic material more readily than it can tolerate a deficit 
of genetic material. 

The most common cause of aneuploidy is nondisjunc¬ 
tion, the failure of chromosomes to disjoin normally during 
meiosis (Fig. 6-7). Nondisjunction can occur during meiosis 
I or meiosis II (see Chapter 2). The resulting gamete either 
lacks a chromosome or has two copies of it, producing a 
monosomic or trisomic zygote, respectively. 

Aneuploid conditions consist primarily of 
monosomies and trisomies. They are usually 
caused by nondisjunction. Autosomal monoso¬ 
mies are almost always lethal, but some auto¬ 
somal trisomies are compatible with survival. 




Parent 


Meiosis I 


Meiosis II 


Gametes 


Fertilization 
with normal 
gamete 


Offspring 



Trisomy Monosomy Monosomy Trisomy 

FIG 6-7 In meiotic nondisjunction, two chromosome homologs migrate to the same daughter 
cell instead of disjoining normally and migrating to different daughter cells. This produces mono¬ 
somic and trisomic offspring. 
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Trisomy 21 

Trisomy 21 (karyotype 47,XY,+21 or 47,XX,+21)’^ is seen in 
approximately 1 of every 700 to 1000 live births, making it the 
most common autosomal aneuploid condition compatible 
with survival to term. This trisomy is the most common cause 
of Down syndrome, a phenotype originally described by John 
Langdon Down in 1866. Almost 100 years elapsed between 
Down’s description of this syndrome and the discovery (in 
1959) that it is caused by the presence of an extra copy of chro¬ 
mosome 21. 

Although there is considerable variation in the appearance 
of persons with Down syndrome, they present with a constella¬ 
tion of features that help the clinician to make a clinical diag¬ 
nosis. The facial features include a depressed nasal root, 
upslanting palpebral fissures, measurably small and sometimes 
overfolded ears, and a flattened maxillary and malar region, 
giving the face a characteristic appearance (Fig. 6-8). Some of 
these features led to the use of the term “mongolism” in earlier 
literature, but this term is inappropriate and is no longer used. 
The cheeks are round, and the corners of the mouth are some¬ 
times downturned. The neck is short, and the skin is redundant 
at the nape of the neck, especially in newborns. The occiput is 
flat, and the hands and feet tend to be rather broad and short. 
Approximately 50% of persons with Down syndrome have a 
single flexion crease across their palms (single palmar crease, 
formerly termed a “simian crease,” though this term is also now 
considered inappropriate). Decreased muscle tone (hypotonia) 
is a highly consistent feature that is also helpful in making a 
diagnosis. None of these features are diagnostic per se as all are 
observed in the general population of infants; it is the clustering 
together that suggests the diagnosis. 

Several medically significant problems occur with increased 
frequency among infants and children with Down syndrome. 
About 3% of these infants develop an obstruction of the duo¬ 
denum or atresia (closure or absence) of the esophagus, duo¬ 
denum, or anus. Respiratory infections are quite common, and 
the risk of developing leukemia is 15 to 20 times higher in 
patients with Down syndrome than in the general population. 
The most significant medical problem is that approximately 
40% of these persons are born with structural heart defects. 
The most common of these is an atrioventricular (AV) canal, 
a defect in which the interatrial and interventricular septa fail 
to fuse normally during fetal development. The result is blood 
flow from the left side of the heart to the right side and then to 
the pulmonary vasculature, producing pulmonary hyperten¬ 
sion. Ventricular septal defects (VSDs) are also common. 

Moderate to mild intellectual disability’ (standardized IQ 
ranging from 40 to 60) occurs in most persons with Down 


*For brevity, the remainder of the karyotype designations for abnor¬ 
malities not involving the sex chromosomes will indicate an affected 
male. 

^Because of the potentially stigmatizing aspects of the term “mental 
retardation,” many professional organizations and advocacy groups 
related to Down syndrome are currently abandoning the term and 
using intellectual or cognitive disability in its place. 


syndrome, and this condition alone accounts for approxi¬ 
mately 10% of all cases of intellectual disability in the United 
States. In recent years clinical trials designed to test whether 
specific medications might improve attention and learning in 
school age-children with Down syndrome have been initiated 
(see www.clinicaltrials.gov). 

Several other medical problems occur in infants and young 
children with Down syndrome. Conductive and sometimes 
sensorineural hearing loss, hypothyroidism, and various eye 
abnormalities are the most important and common. Clinical 
Commentary 6-1 outlines a plan for routine medical care of 
infants and children with Down syndrome. 

The medical problems seen in children with Down syn¬ 
drome result in decreased survival rates. Congenital heart 
defects are the most important single cause of early mortality. 
In the early 1960s, only about half of children with this dis¬ 
order survived as long as 5 years. As a result of improvements 
in corrective surgery, antibiotic treatment, and management 
of leukemia, the survival rate has increased considerably in 
the past 40 years. Currently, it is estimated that more than 
80% of children with Down syndrome will survive to age 10 
years, and more than half survive to 50 years of age. There is 
convincing evidence that enriched environments and educa¬ 
tional interventions can produce significant improvements in 
intellectual function. 

Males with Down syndrome are nearly always sterile, with 
only a few reported cases of reproduction. Many females with 
Down syndrome can reproduce, although approximately 
40% fail to ovulate. A female with Down syndrome has a 50% 
risk of producing a gamete with two copies of chromosome 
21 (which would then produce a trisomic zygote). However, 
because about 75% of trisomy 21 conceptions are spontane¬ 
ously aborted, the risk of producing affected live-born off¬ 
spring is considerably lower than 50% in women with Down 
syndrome. Because reproduction is uncommon, nearly all 
cases of trisomy 21 can be regarded as new mutations. 

Approximately 95% of Down syndrome cases are caused 
by nondisjunction, and most of the remainder are caused by 
chromosome translocations (see later discussion). Compari¬ 
sons of chromosome 21 polymorphisms in parents and off¬ 
spring have demonstrated that the extra chromosome is 
contributed by the mother in 90% to 95% of trisomy 21 cases. 
About 75% of these maternal nondisjunctions occur during 
meiosis I, and the remainder occur during meiosis II. As 
discussed in greater detail later, there is a strong correlation 
between maternal age and the risk of producing a child with 
Down syndrome. 

Mosaicism (see Chapter 4) is seen in approximately 2% to 
4% of trisomy 21 live births. These persons have some normal 
somatic cells and some cells with trisomy 21. This type of 
mosaicism in a male is designated 47,XY,-l-21[10]/46,XY[10], 
with the numbers inside the brackets indicating the number of 
cells scored with each karyotype. The most common cause of 
mosaicism in trisomy is a trisomic conception followed by loss 
of the extra chromosome during mitosis in some embryonic 
cells. Mosaicism often results in milder clinical expression of 
the phenotype associated with a chromosome abnormality. 
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FIG 6-8 A, An infant with Down syndrome, illustrating typical features of this disorder: upslanting 
palpebral fissures, redundant skin of the inner eyelid (epicanthic fold), protruding tongue, and 
low nasal bridge. B, Same girl as In A, 7 years later. Note that the typical features are present 
but less obviously expressed. C, A karyogram of a male with trisomy 21. 
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CLINICAL COMMENTARY 6-1 

Anticipatory Guidance and Health Supervision in Children with Down Syndrome 


An approach termed health supervision and anticipatory guid¬ 
ance has evolved for the care and treatment of persons with 
genetic syndromes and chronic diseases. After a thorough 
study of the disease in question (including an extensive litera¬ 
ture review), basic guidelines are established for the screening, 
evaluation, and management of patients. If followed by the 
primary care practitioner or the specialist, these guidelines 
should help to prevent further disability or illness. We illustrate 
the health supervision and anticipatory guidance approach 
with the current guidelines for care of children with Down 
syndrome. 

• As mentioned in the text, AV canals are the most common 
congenital heart defect seen in newborns with Down syn¬ 
drome. Surgical correction of this condition is appropriate if 
it is detected before 1 year of age; after this time, pulmonary 
hypertension has been present too long for surgery to be 
successful. Accordingly, it is now recommended that an 
echocardiogram be performed during the newborn period 
and no later than 6 months. 

• Because Down syndrome patients often have strabismus 
(deviation of the eye from its normal visual axis) and other 
eye problems, they should be examined regularly by their 
physician. If any symptoms or signs are observed, the 
patient is referred to an ophthalmologist familiar with Down 


syndrome. In asymptomatic children, an ophthalmological 
examination before the age of 4 years is recommended to 
evaluate visual acuity. 

• Hypothyroidism is common, especially during adolescence. 
Therefore, thyroid hormone levels should be measured 
annually. 

• Sensorineural and conductive hearing loss are both seen In 
children with Down syndrome. The routine follow-up should 
Include a hearing test at birth and every 6 months until 2 
years of age, with subsequent testing as needed. 

• Instability of the first and second vertebrae has led to spinal 
cord injuries in some older Down syndrome patients. It is 
thus suggested that imaging studies be carried out in chil¬ 
dren with neurological symptoms and in those planning to 
participate in athletic activities. 

• Referral of infants and children with Down syndrome 
to preschool programs to provide intervention for develop¬ 
mental disabilities Is an important component of routine 
care. 

Similar series of guidelines have been developed for children 
with trisomy 18, Williams syndrome, and Turner syndrome. In 
principle, the anticipatory guidance and health supervision 
approach can be applied to any genetic disease for which there 
is sufficient knowledge. 


Depending on the timing and the way the mosaicism 
originated, some persons have tissue-specific mosaicism. As 
the term suggests, this type of mosaicism is confined only to 
certain tissues. This can complicate diagnosis because cyto¬ 
genetic analysis is usually based on a single tissue type (usually 
circulating lymphocytes derived from a blood sample, or, less 
commonly, fibroblasts derived from a skin biopsy). Mosa¬ 
icism affecting primarily the germline of a parent can lead to 
multiple recurrences of Down syndrome in the offspring. 
This factor helps to account for the fact that the recurrence 
risk for Down syndrome among mothers younger than 30 
years is about 1% (i.e., 10 times higher than the population 
risk for this age group). 

Because of the prevalence and clinical importance of 
Down syndrome, considerable effort has been devoted to 
determining which genes on chromosome 21 account for the 
features of this disorder. A candidate for intellectual disability 
in Down syndrome is DYRKIA, a kinase gene that causes 
learning and memory defects when it is over-expressed in 
mice. Another gene located in the critical region, APP, encodes 
the amyloid (3 precursor protein. A third copy of APP is likely 
to account for the occurrence of Alzheimer disease features 
in nearly all Down syndrome patients by 40 years of age. 
Mutations of APP cause a small percentage of Alzheimer 
disease cases (see Chapter 12), and Down syndrome indi¬ 
viduals with partial trisomies that do not include the APP 
gene do not develop Alzheimer disease features. 

Trisomy 21, which causes Down syndrome, is 
the most common autosomal aneuploidy seen 


among live births. The most significant fea¬ 
tures include intellectual disability, gastroin¬ 
testinal tract obstruction, congenital heart 
defects, and respiratory infections. The extra 
21st chromosome is contributed by the mother 
in approximately 90% of cases. Mosaicism is 
seen in 2% to 4% of Down syndrome cases, 
and it often accompanies a milder phenotype. 
Specific genes contributing to the Down syn¬ 
drome phenotype are being identified. 

Trisomy 18 

Trisomy 18 (47,XY,-l-18), also known as Edwards syndrome, 
is the second most common autosomal trisomy, with a preva¬ 
lence of about 1 per 6000 live births. It is, however, much 
more common at conception and is the most common chro¬ 
mosome abnormality among stillborns with congenital mal¬ 
formations. It is estimated that fewer than 5% of trisomy 18 
conceptions survive to term. 

The Edwards syndrome phenotype is as discernible as 
Down syndrome, but because it is less common, it is less likely 
to be recognized clinically. Infants with trisomy 18 have pre¬ 
natal growth deficiency (weight that is low for gestational 
age), characteristic facial features, and a distinctive hand 
abnormality that often helps the clinician to make the initial 
diagnosis (Fig. 6-9). Minor anomalies of diagnostic impor¬ 
tance include small ears with unraveled helices, a small mouth 
that is often hard to open, a short sternum, and short halluces 
(first toes). Most babies with trisomy 18 have major malfor¬ 
mations. Congenital heart defects, particularly VSDs, are the 
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FIG 6-9 A girl with full trisomy 18 at 3 years (A) and 13 years (B) of age. She shows the typical 
facial features of an older child with short palpebral fissures and ear variations. A also depicts 
the overriding of the index finger onto the third finger, a characteristic finding of the hands in 
the syndrome. 


most common and occur in 90% of children. Other medically 
significant congenital malformations include omphalocele 
(protrusion of the bowel into the umbilical cord), radial 
aplasia (missing radius bone), diaphragmatic hernia, and, 
occasionally, spina bifida. 

About 50% of infants with trisomy 18 die within the first 
several weeks of life, and only about 5% to 8% survive to 12 
months of age. A combination of factors, including aspiration 
pneumonia, predisposition to infections and apnea, and con¬ 
genital heart defects, accounts for the high mortality rate. 

Marked developmental disabilities are seen among those 
trisomy 18 patients who survive infancy. The degree of delay 
is much more significant than in Down syndrome, and most 
children are not able to walk independently. However, chil¬ 
dren with trisomy 18 do progress in their milestones, albeit 
slowly, and older children learn some communication skills. 

More than 95% of patients with Edwards syndrome have 
complete trisomy 18; only a small percentage have mosaicism. 
As in trisomy 21, there is a significant maternal age effect, and 
more than 90% of trisomy 18 cases are the result of an extra 
chromosome transmitted by the mother. 

Trisomy 13 

Trisomy 13 (47,XY,-t-13), also termed Patau syndrome, is seen 
in about 1 of every 10,000 births. The malformation pattern 
is quite distinctive and usually permits clinical recognition. It 
consists primarily of orofacial clefts, microphthalmia (small, 
abnormally formed eyes), and postaxial polydactyly (Fig. 
6-10). Malformations of the central nervous system are often 
seen, as are heart defects and renal abnormalities. Cutis 



FIG 6-10 An 8-year-old girl with full trisomy 13 showing her 
small eyes and prominent, wide nose. 
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aplasia (a defect of the skin of the scalp on the posterior 
occiput) can also occur. 

The survival rate is very similar to that of trisomy 18, and 
about 95% of live-born infants die during the first year of life. 
Children who survive infancy have significant developmental 
disability, with skills seldom progressing beyond those of a 
child of 2 years. However, as in trisomy 18, children with 
trisomy 13 do progress somewhat in their development and 
are able to communicate with their families to some degree. 

About 80% of patients with Patau syndrome have full 
trisomy 13. Most of the remaining patients have trisomy of 
the long arm of chromosome 13 due to a translocation (see 
later discussion). As in trisomies 18 and 21, the risk of bearing 
a child with this condition increases with advanced maternal 
age. It is estimated that 95% or more of trisomy 13 concep¬ 
tions are spontaneously lost during pregnancy. 

Trisomies of the 13th and 18th chromosomes 
are sometimes compatible with survival to 
term, although 95% or more of affected fetuses 
are spontaneously aborted. These trisomies 
are much less common at birth than is trisomy 
21, and they produce more serious disease 
features, with 90% to 95% mortality during the 
first year of life. As in trisomy 21, there is a 
maternal age effect, and the mother contrib¬ 
utes the extra chromosome in more than 90% 
of cases. 

Trisomies, Nondisjunction, and Maternal Age 

The prevalence of Down syndrome among offspring of 
mothers of different ages is shown in Figure 6-11. Among 
mothers younger than 30 years of age, the risk is less than 
1/1000. It increases to approximately 1/400 at age 35 years. 


NUMBER OF DOWN SYNDROME 
CASES PER 1000 BIRTHS 



FIG 6-11 The prevalence of Down syndrome among live 
births in relation to age of the mother. The prevalence 
Increases with maternal age and becomes especially notable 
after the age of 35 years. (Data from Hook EB, Chambers 
GM. Birth defects. 1977;23[3A1:123-141.) 


1/100 at age 40 years, and approximately 1/25 after age 45 
years. Most other trisomies, including those in which the 
fetus does not survive to term, also increase in prevalence as 
maternal age increases. This risk is one of the primary indica¬ 
tions for prenatal diagnosis (see Chapter 13). 

Several hypotheses have been advanced to account for this 
increase, including the idea that a trisomic pregnancy is less 
likely to be spontaneously aborted in older women. Direct 
studies of the frequency of chromosome abnormalities in 
sperm and egg cells indicate that the pattern is instead due to 
an increase in nondisjunction among older mothers. Recall 
that nearly all of a female’s oocytes are formed during her 
embryonic development. (There is recent evidence that a 
small number of oocytes may be produced later in life.) They 
remain suspended in prophase I until they are shed during 
ovulation. Thus, an ovum produced by a 45-year-old woman 
is itself about 45 years old. This long period of suspension in 
prophase I might impair normal chromosome disjunction, 
although the precise nature of this mechanism is not 
understood. 

Many factors have been examined to determine whether 
they affect the frequency of nondisjunction in women. These 
include hormone levels, cigarette smoking, autoimmune 
thyroid disease, alcohol consumption, and radiation (the 
latter does increase nondisjunction when administered in 
very large doses to experimental animals). None of these 
factors has shown consistent correlations with nondisjunc¬ 
tion in humans, however, and maternal age remains the only 
known correlated factor. 

Although maternal age is strongly correlated with the risk 
of Down syndrome, approximately three fourths of Down 
syndrome children are born to mothers younger than 35 
years of age. This is because the great majority of children 
(more than 90%) are borne by women in this age group. 

Numerous studies, including direct analysis of sperm cells, 
have tested the hypothesis of a paternal age effect for triso¬ 
mies. The consensus is that such an effect, if it exists at all, 
is minor. This is probably because spermatocytes, unlike 
oocytes, are generated throughout the life of the male. 

Nearly all autosomal trisomies increase with 
maternal age as a result of nondisjunction in 
older mothers. There is little evidence for a 
paternal age effect on nondisjunction in males. 

Sex Chromosome Aneuploidy 

Among live-born infants, about 1 in 400 males and 1 in 650 
females have some form of sex chromosome aneuploidy. Pri¬ 
marily because of X inactivation, the consequences of this 
class of aneuploidy are less severe than those of autosomal 
aneuploidy. With the exception of the complete absence of an 
X chromosome, all of the sex chromosome aneuploidies are 
compatible with survival in at least some cases. 

Monosomy of the X Chromosome (Turner Syndrome) 

The phenotype associated with a single X chromosome (45, 
X) was described by Henry Turner in 1938. (An earlier 
description was given by Otto Ullrich in 1930.) Persons with 
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Turner syndrome are female and usually have a characteristic 
phenotype, including the variable presence of proportionate 
short stature, sexual infantilism and ovarian dysgenesis, and 
a pattern of major and minor malformations. The physical 
features can include a triangle-shaped face; posteriorly 
rotated external ears; and a broad, “webbed neck” (Fig. 6-12). 
In addition, the chest is broad and shield-like in shape. 
Lymphedema of the hands and feet is observable at birth. 
Many infants with Turner syndrome have congenital heart 
defects, most commonly obstructive lesions of the left side of 
the heart (bicuspid aortic valve in 50% of patients and coarc¬ 
tation [narrowing] of the aorta in 15-30%). Severe obstruc¬ 
tions should be surgically repaired. About 50% of persons 
with Turner syndrome have structural kidney defects, but 
they usually do not cause medical problems. There is typi¬ 
cally some diminution in spatial perceptual ability, but intel¬ 
ligence is usually normal. 

Girls with Turner syndrome exhibit proportionate short 
stature and do not undergo an adolescent growth spurt. 
Mature height is reduced by approximately 20 cm, on average. 
Growth hormone administration produces increased height 
somewhat in these girls, and families now commonly choose 
this therapy. In most persons with Turner syndrome, streaks 
of connective tissue, rather than ovaries, are seen (gonadal 
dysgenesis). Lacking normal ovaries, they do not usually 
develop secondary sexual characteristics, and most women 
with this condition are infertile (about 5-10% have sufficient 
ovarian development to undergo menarche, and a small 



FIG 6-12 A girl with Turner syndrome (45,X). Note the char¬ 
acteristically broad, webbed neck. Stature is reduced, and 
swelling (lymphedema) is seen in the ankles and wrists. 


number have borne children). Teenagers with Turner syn¬ 
drome are typically treated with estrogen to promote the 
development of secondary sexual characteristics. The dose is 
then continued at a reduced level to maintain these charac¬ 
teristics and to help prevent osteoporosis. 

The diagnosis of Turner syndrome is often made in the 
newborn infant, especially if there is a noticeable webbing of 
the neck coupled with a heart defect. The facial features are 
more subtle than in the autosomal abnormalities described 
previously, but the experienced clinician can often diagnose 
Turner syndrome on the basis of one or more of the listed 
clues. If Turner syndrome is not recognized in infancy or 
childhood, it is often diagnosed later because of short stature 
and/or amenorrhea. 

The chromosome abnormalities in persons with Turner 
syndrome are quite variable. About 50% of these patients 
have a 45,X karyotype in their peripheral lymphocytes. 
At least 30% to 40% have mosaicism, most commonly 
45,X/46,XX and less commonly 45,X/46,XY. Patients with 
mosaicism who have Y chromosomes in some cells are pre¬ 
disposed to neoplasms (gonadoblastomas) in the gonadal 
streak tissue. About 10% to 20% of patients with Turner 
syndrome have structural X chromosome abnormalities 
involving a deletion of some or all of Xp. This variation in 
chromosome abnormality helps to explain the considerable 
phenotypic variation seen in this syndrome. 

Approximately 60% to 80% of monosomy X cases are 
caused by the absence of a paternally derived sex chromo¬ 
some, occurring either during early mitosis in the embryo or 
during meiosis in the father (i.e., the offspring receives an X 
chromosome only from the mother). The 45,X karyotype is 
estimated to occur in 1% to 2% of conceptions, but Turner 
syndrome is seen in only about 1/2000 to 1/3000 live-born 
girls. Thus, the great majority (more than 99%) of 45,X con¬ 
ceptions are lost prenatally. Among those that do survive to 
term, many have mosaicism, and mosaicism of the placenta 
alone (confined placental mosaicism) is especially common. 
It is likely that the presence of some normal cells in mosaic 
fetuses enhances fetal survival. 

Molecular analysis has begun to pinpoint specific genes 
involved in the Turner syndrome phenotype. For example, 
mutations in the SHOX gene, which encodes a transcription 
factor expressed in embryonic limbs, produce short stature. 
This gene is located on the distal tip of the X and Y short 
arms (in a region of the X chromosome that escapes inactiva¬ 
tion; see Clinical Commentary 6-2). Thus, it is normally tran¬ 
scribed in two copies in both males and females. In females 
with Turner syndrome, this gene is present in only one active 
copy, and the resulting haploinsufficiency contributes to 
short stature. 

Most females with Turner syndrome have a 
45,X karyotype. Although this disorder is 
common at conception, it is relatively rare 
among live births, reflecting a very high rate 
of spontaneous abortion. Mosaicism, includ¬ 
ing confined placental mosaicism, appears to 
increase the probability of survival to term. 
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CLINICAL COMMENTARY 6-2 

XX Males, XY Females, and the Genetic Basis of Sex Determination 

During normal meiosis in the male, crossover occurs between 
the tip of the short arm of the Y chromosome and the tip of 
the short arm of the X chromosome (Fig. 6-13). These regions 
of the X and Y chromosomes contain highly similar DNA 
sequences. Because this resembles the behavior of auto- 
somes during meiosis, the distal portion of the Y chromosome 
is known as the pseudoautosomal region. It spans approxi¬ 
mately 2.5 Mb. 

Just centromeric of the pseudoautosomal region lies a gene 
known as SRY (sex-determining region on the Y). This gene, 
which is expressed in embryonic development, encodes a tran¬ 
scription factor that interacts with other genes to initiate the 
development of the undifferentiated embryo into a male 
(including Sertoli cell differentiation and secretion of mullerian- 
inhibiting substance). In particular, the protein product of SRY 
binds to an enhancer element that regulates expression of the 
SOX9 gene, which in turn regulates a series of genes that 
promote male development while repressing ovarian develop¬ 
ment. SRY thus acts as a key regulatory switch in sex deter¬ 
mination. When the mouse Srygene is inserted experimentally 
into a female mouse embryo, a male mouse is produced. Loss- 
of-function mutations of SRY can produce individuals with an 
XY karyotype but a female phenotype. SOX9 acts as the next 
switch in this pathway, and SOX9 mutations can produce sex 
reversal (XY females) and campomelic dysplasia (malforma¬ 
tions of bone and cartilage). 

Approximately 1 of every 20,000 males presents with a phe¬ 
notype similar to Klinefelter syndrome (without increased 
height), but chromosome evaluation shows that they have a 
normal female karyotype (46,XX). It has been demonstrated 
that these XX males have an X chromosome that includes the 
SRY gene. This is a result of a faulty crossover between the X 
and Y chromosomes during male meiosis, such that the SRY 
gene, instead of remaining on the Y chromosome, is trans¬ 
ferred to the X chromosome. The offspring who inherits this X 
chromosome from his father consequently has a male pheno¬ 
type. Conversely, an offspring who inherits a Y chromosome 
that lacks the SRY gene is an XY female. These females have 
gonadal streaks rather than ovaries and have poorly developed 
secondary sexual characteristics. 
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FIG 6-13 The distal short arms of the X and Y chromo¬ 
somes exchange material during meiosis in the male. The 
region of the Y chromosome in which this crossover occurs 
is called the pseudoautosomal region. The SRY gene, which 
triggers the process leading to male gonadal differentiation, 
is located just outside the pseudoautosomal region. Occa¬ 
sionally, the crossover occurs on the centromeric side of 
the SRY gene, causing it to lie on an X chromosome instead 
of on the Y chromosome. An offspring receiving this X 
chromosome will be an XX male, and an offspring receiving 
the Y chromosome will be an XY female. 


Klinefelter Syndrome 

Like Down and Turner syndromes, the syndrome associated 
with a 47,XXY karyotype was identified before the underlying 
chromosomal abnormality was understood. Described in 
1942 by Harry Klinefelter, the syndrome that bears his name 
is seen in approximately 1/500 to 1/1000 male births. Although 
Klinefelter syndrome is a common cause of primary hypogo¬ 
nadism in males, the phenotype is less striking than those of 
the syndromes described thus far. Patients with Klinefelter 
syndrome tend to be taller than average, with disproportion¬ 
ately long arms and legs (Fig. 6-14). Clinical examination of 
postpubertal patients reveals small testes (less than 10 mL in 
volume), and most males with Klinefelter syndrome are 
sterile as a result of atrophy of the seminiferous tubules. Tes¬ 
tosterone levels in adolescents and adults are low. Gyneco¬ 
mastia (breast development) is seen in approximately one 


third of affected persons and leads to an increased risk of 
breast cancer, which can be reduced by mastectomy (breast 
removal). Body hair is typically sparse after puberty, and 
muscle mass tends to be reduced. In addition, there is a pre¬ 
disposition for learning disabilities and a reduction in verbal 
IQ. Although intelligence is usually in the normal range, the 
IQ is on average 10 to 15 points lower than that of the affected 
persons siblings. Because of the subtlety of this disorder, 
Klinefelter syndrome is often not diagnosed until after 
puberty, and the condition is sometimes first ascertained in 
fertility clinics. 

The extra X chromosome is derived maternally in about 
50% of Klinefelter cases, and the syndrome increases in inci¬ 
dence with advanced maternal age. Mosaicism, which is seen 
in about 15% of patients, increases the likelihood of viable 
sperm production. Individuals with the 48,XXXY and 
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FIG 6-14 A male with Klinefelter syndrome (47,XXY). Stature 
is increased, gynecomastia may be present, and body shape 
may be somewhat feminine. 


49,XXXXY karyotypes have also been reported. Because they 
have a Y chromosome, they have a male phenotype, but the 
degree of developmental disability and physical abnormality 
increases with each additional X chromosome. 

Testosterone therapy, beginning in midadolescence, can 
enhance secondary sex characteristics, and it helps to decrease 
the risk of osteoporosis. There is some evidence that this 
treatment also improves psychological well-being. 



Males with Klinefelter syndrome (47,XXY) are 
taller than average, might have reduced IQ, 
and are usually sterile. Testosterone therapy 
and mastectomy for gynecomastia are some¬ 
times recommended. 


Trisomy X 

The 47,XXX karyotype occurs in approximately 1/1000 
females and usually has benign consequences. Overt physical 
abnormalities are rarely seen, but these females sometimes 
have sterility, menstrual irregularity, or mild cognitive dis¬ 
ability. As in Klinefelter syndrome, the 47,XXX karyotype is 
often first ascertained in fertility clinics. Approximately 90% 
of cases are the result of nondisjunction in the mother, and, 
as with other trisomies, the incidence increases among the 
offspring of older mothers. 


Females have also been seen with four, five, or even more 
X chromosomes. Each additional X chromosome is accom¬ 
panied by increased intellectual disability and physical 
abnormality. 


47,XYY Syndrome 

The final sex chromosome aneuploidy to be discussed is the 
47,XYY karyotype. Males with this karyotype tend to be taller 
than average, and they have a 10- to 15-point reduction in 
average IQ. This condition, which causes few physical prob¬ 
lems, achieved great notoriety when its incidence in the male 
prison population was discovered to be as high as 1/30, com¬ 
pared with 1/1000 in the general male population. This led 
to the suggestion that this karyotype might confer a predis¬ 
position to violent, criminal behavior. A number of studies 
have addressed this issue and have shown that XYY males are 
not inclined to commit violent crimes. There is, however, 
evidence of an increased incidence of minor behavioral dis¬ 
orders, such as hyperactivity, attention deficit disorder, and 
learning disabilities. 



The 47,XXX and 47,XYY karyotypes are seen 
in about 1/1000 females and males, respec¬ 
tively. Each involves a slight degree of reduc¬ 
tion in IQ but few physical problems. 


CHROMOSOME ABNORMALITIES AND 
PREGNANCY LOSS 

For a long time it was difficult to detect the early stages of 
pregnancy with certainty. It was thus possible for a woman 
to become pregnant but to lose the embryo before knowing 
of the pregnancy. Sensitive urinary assays of chorionic gonad¬ 
otropin levels, which increase when the embryo implants 
in the uterine wall, have allowed researchers to pinpoint 
accurately the occurrence of pregnancy at this early stage. 
Follow-up of women in whom implantation was verified in 
this way has shown that about one third of pregnancies are 
lost after implantation (the number lost before implantation 
is unknown). Therefore, spontaneous pregnancy loss is 
common in humans. 

As mentioned earlier, chromosome abnormalities are the 
leading known cause of pregnancy loss. It is estimated that a 
minimum of 10% to 20% of conceptions have a chromosome 
abnormality, and at least 95% of these chromosomally abnor¬ 
mal conceptions are lost before term. Karyotype studies of 
miscarriages indicate that about 50% of the chromosome 
abnormalities are trisomies, 20% are monosomies, and 15% 
are triploids. The remainder consists of tetraploids and struc¬ 
tural abnormalities. Some chromosome abnormalities that 
are common at conception seldom or never survive to term. 
For example, trisomy 16 is thought to be the most common 
trisomy at conception, but it is never seen in live births. 

It is possible to study chromosome abnormalities directly 
in sperm and egg cells. Oocytes are typically obtained from 
unused material in in vitro fertilization studies. Karyotypes 
obtained from these cells indicate that at least 20% to 25% of 
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oocytes have missing or extra chromosomes. Human sperm 
cells can be studied by FISH analysis or after fusing them with 
hamster oocytes so that their DNA begins mitosis and con¬ 
denses, allowing easier visualization. The frequency of aneu- 
ploidy in these sperm cells is about 3% to 4%. Structural 
abnormalities (see later discussion) are seen in about 1% of 
oocytes and 5% of sperm cells, and the incidence increases 
with advanced paternal age. Undoubtedly, this high rate of 
chromosome abnormality contributes importantly to later 
pregnancy loss. 

These approaches, while informative, can involve some 
biases. For example, mothers in whom in vitro fertilization is 
performed are not a representative sample of the population. 
In addition, their oocytes have been stimulated artificially, 
and only those oocytes that could not be fertilized by sperm 
cells are studied. Thus, the oocytes themselves might not be 
a representative sample. The sperm cells studied in human- 
hamster hybrids represent only those that are capable of pen¬ 
etrating the hamster oocyte and again might not be a 
representative sample. 

FISH analysis of aneuploidy can evaluate many cells quite 
rapidly, giving it an important advantage over the human- 
hamster technique. In general, the FISH studies have yielded 
results similar to those of the human-hamster studies, 
showing that on average, the frequency of disomy is approxi¬ 
mately 0.15% for each autosome and 0.26% for the sex chro¬ 
mosomes. These studies have also confirmed a tendency for 
elevated frequencies of nondisjunction of the sex chromo¬ 
somes and some of the acrocentric chromosomes, including 
chromosome 21, in sperm cells. 

Pregnancy loss is common in humans, with 
approximately one third of pregnancies lost 
spontaneously after implantation. Chromo¬ 
some abnormalities, which have been studied 
in sperm and egg cells and in miscarriages and 
stillbirths, are an important cause of preg¬ 
nancy loss. 

ABNORMALITIES OF 
CHROMOSOME STRUCTURE 

In addition to the loss or gain of whole chromosomes, parts 
of chromosomes can be lost or duplicated as gametes are 
formed, and the arrangement of portions of chromosomes 
can be altered. Structural chromosome abnormalities may be 
unbalanced (the rearrangement causes a gain or loss of chro¬ 
mosomal material) or balanced (the rearrangement does not 
produce a loss or gain of chromosome material). Unlike 
aneuploidy and polyploidy, balanced structural abnormali¬ 
ties often do not produce serious health consequences. Nev¬ 
ertheless, abnormalities of chromosome structure, especially 
those that are unbalanced, can produce serious disease in 
individuals or their offspring. 

Alterations of chromosome structure can occur when 
homologous chromosomes line up improperly during meiosis 


(e.g., unequal crossover, as described in Chapter 5). In addi¬ 
tion, chromosome breakage can occur during meiosis or 
mitosis. Mechanisms exist to repair these breaks, and usually 
the break is repaired perfectly with no damage to the daugh¬ 
ter cell. Sometimes, however, the breaks remain, or they heal 
in a fashion that alters the structure of the chromosome. The 
likelihood of chromosome breakage may be increased in the 
presence of certain harmful agents, called clastogens. Clasto- 
gens identified in experimental systems include ionizing 
radiation, some viral infections, and some chemicals, but 
none have been convincingly demonstrated to cause chromo¬ 
some abnormalities in humans. 

Translocations 

A translocation is the interchange of genetic material between 
nonhomologous chromosomes. Balanced translocations rep¬ 
resent one of the most common chromosomal aberrations in 
humans, occurring in 1 of every 500 to 1000 individuals 
(Table 6-2). There are two basic types of translocations, 
reciprocal and Robertsonian. 

Reciprocal Translocations 

Reciprocal translocations happen when breaks occur in 
two different chromosomes and the material is mutually 
exchanged. The resulting chromosomes are called derivative 
chromosomes. The carrier of a reciprocal translocation is 
usually unaffected because he or she has a normal comple¬ 
ment of genetic material. However, the carrier’s offspring can 
be normal, can carry the translocation, or can have duplica¬ 
tions and deletions of genetic material. 


TABLE 6-2 Prevalence of Chromosomal 
Abnormalities among Newborns 

1 ABNORMALITY 

PREVALENCE AT BIRTH | 

Autosomal Syndromes 

Trisomy 21 

1/700 

Trisomy 18 

1/6000 

Trisomy 13 

1/10,000 

Unbalanced rearrangements 

1/17,000 

Balanced rearrangements 
Robertsonian translocations 

1/1000 

Reciprocal translocations 

1/11,000 

Sex Chromosome Abnormalities 

47,XXY 

1/1000 male births 

47,XYY 

1/1000 male births 

45,X* 

1/5000 female births 

47,XXX 

1/1000 female births 

All Chromosome Abnormalities 

Autosomal disorders and 

1/230 

unbalanced rearrangements 
Balanced rearrangements 

1/500* 


*The 45,X karyotype accounts for about half of the cases of Turner 
syndrome. 
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An example of a reciprocal translocation between chro¬ 
mosomes 3 and 6 is shown in Figure 6-15. The distal half of 
the short arm of chromosome 6 is translocated to the short 
arm of chromosome 3, and a small piece of chromosome 3 
is translocated to the short arm of chromosome 6. If the 
translocations occur at 3pl3 and 6pl4, the karyotype is des¬ 
ignated 46,XX, t(3;6)(pl3;pl4). The offspring of this woman 
received the derivative chromosome 3, termed der(3), and the 
normal 6; thus, the child had a partial trisomy of the distal 


portion of chromosome 6 (i.e., 6p trisomy). This is a well- 
established but rather uncommon chromosomal syndrome. 

Reciprocal translocations are caused by two 
breaks on different chromosomes, with a sub¬ 
sequent exchange of material. Although carri¬ 
ers of balanced reciprocal translocations 
usually have normal phenotypes, their off¬ 
spring might have a partial trisomy or a partial 
monosomy and an abnormal phenotype. 




Normal 3 




Normal 6 


Normal 6 


B Offspring 

FIG 6-15 A, The parent has a reciprocal balanced transloca¬ 
tion involving the short arms of chromosomes 6 and 3. The 
distal short arm of the 6 has been translocated to the very 
distal tip of the 3. A small piece of chromosome 3 is attached 
to the derivative 6. This person had a child whose chromo¬ 
somes are depicted in B. The child received the derivative 
chromosome 3 (with part of the 6 short arm attached) and 
the normal 6; from the other parent, the child received a 
normal 3 and a normal 6. Therefore, the child had a partial 
trisomy of the 6 short arm and, presumably, a small deletion 
of the 3 short arm. 



Robertsonian Translocations 

In Robertsonian translocations, the short arms of two non- 
homologous chromosomes are lost and the long arms fuse at 
the centromere to form a single chromosome (Fig. 6-16). This 
type of translocation is confined to the acrocentric chromo¬ 
somes (13, 14, 15, 21, and 22) and has no effect on a carrier 
because the short arms of these chromosomes are very small 
and contain no essential genetic material. Although Robert¬ 
sonian translocation carriers have only 45 chromosomes in 
each cell, they are phenotypically unaffected. Their offspring, 
however, may inherit an extra or missing long arm of an 
acrocentric chromosome. 


Robertsonian translocation 



□ 



Chromosome 13 t(13;14) Chromosome 14 



FIG 6-16 In a Robertsonian translocation, shown here, the 
long arms of two acrocentric chromosomes (13 and 14) fuse, 
forming a single chromosome. 
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A common Robertsonian translocation involves fusion of 
the long arms of chromosomes 14 and 21. The karyotype of 
a male carrier of this translocation is 45,XY,der(14;21)(qlO; 
qlO). This person lacks one normal 14 and one normal 21 
and instead has a chromosome derived from a translocation 
of the entire long arms of chromosomes 14 and 21. During 
meiosis in this person, the translocation chromosome must 
still pair with its homologs. Figure 6-17 illustrates the ways 
these chromosomes can segregate in the gametes formed by 
the translocation carrier. If alternate segregation occurs, 
then the offspring are either chromosomally normal or have 
a balanced translocation with a normal phenotype. If one of 
the adjacent segregation patterns occurs, then the gametes 
are unbalanced and the offspring may have trisomy 14, 
monosomy 14, monosomy 21, or trisomy21. (Note that these 
trisomies and monosomies are genetically the same as triso- 
mies and monosomies produced by nondisjunction, because 
only the long arms of these chromosomes contain genetically 
significant material.) Fetuses with the first three adjacent seg¬ 
regation patterns do not survive to term, and the last pattern 
results in an infant with three copies of the long arm of chro¬ 
mosome 21 and a Down syndrome phenotype. Robertsonian 
translocations are responsible for approximately 5% of Down 
syndrome cases. 

It is expected that the three types of conceptions compat¬ 
ible with survival would occur in equal frequencies: One third 
would be completely normal, one third would carry the 
translocation but be phenotypically normal, and one third 
would have Down syndrome. In part because of prenatal loss, 
the actual fraction of live-born offspring with Down syn¬ 
drome is less than one third (about 10-15% for mothers who 
carry the translocation, and only 1-2% for fathers who carry 
it). This recurrence risk, however, is greater than the risk for 
parents of a child who has the nondisjunction type of Down 
syndrome (1% for mothers younger than 30 years of age). 
This difference in recurrence risks demonstrates why it is 
critical to order a chromosome study whenever a condition 
such as Down syndrome is suspected. 

Robertsonian translocations occur when the 
long arms of two acrocentric chromosomes 
fuse at the centromere. The carrier of a Rob¬ 
ertsonian translocation can produce concep¬ 
tions with monosomy or trisomy of the long 
arms of acrocentric chromosomes. 

Deletions 

A deletion is caused by a chromosome break and subsequent 
loss of genetic material. A single break leading to a loss that 
includes the chromosome’s tip is called a terminal deletion. 
An interstitial deletion results when two breaks occur and the 
material between the breaks is lost. For example, a chromo¬ 
some segment with normal DNA can be symbolized 
ABCDEFG. An interstitial deletion could produce the sequence 
ABEFG, and a terminal deletion could produce ABCDE. 

Usually, a gamete containing a chromosome with a dele¬ 
tion unites with a normal gamete to form a zygote. The zygote 


then has one normal chromosome and a homolog with the 
deletion. Microscopically visible deletions generally involve 
multiple genes, and the consequences of losing this much 
genetic material from even one member of the chromosome 
pair can be severe. After the three autosomal aneuploidies 
described earlier, the autosomal deletion syndromes are the 
most common group of clinically significant chromosome 
abnormalities. 

A well-known example of a chromosome deletion syn¬ 
drome is the cri-du-chat syndrome. This term (French, “cry 
of the cat”) describes the distinctive cry of the child. This cry 
usually becomes less obvious as the child ages, making a clini¬ 
cal diagnosis more difficult after 2 years of age. Cri-du-chat 
syndrome is caused by a deletion of the distal short arm of 
chromosome 5; the karyotype is 46,XY,del(5p). Seen in 
approximately 1 in 50,000 live births, it is also characterized 
by intellectual disability (average IQ about 35), microcephaly 
(small head), and a characteristic but not distinctive facial 
appearance. Although mortality rates are increased if serious 
congenital defects are present, most persons with cri-du-chat 
syndrome now survive to adulthood. 

Wolf-Hirschhorn syndrome (Fig. 6-18), caused by a dele¬ 
tion of the distal short arm of chromosome 4, is another 
well-characterized deletion syndrome. Other well-known 
deletions include those of 18p, 18q, and 13q. With the excep¬ 
tion of the 18p deletion syndrome, each of these disorders is 
relatively distinctive, and the diagnosis can often be made 
before the karyotype is obtained. The features of the 18p 
deletion syndrome are more subtle, and usually this condi¬ 
tion is recognized when chromosome analysis is performed 
for evaluation of developmental disability. 

Microscopically observable chromosome dele¬ 
tions, which may be either terminal or intersti¬ 
tial, usually affect a fairly large number of 
genes and produce recognizable syndromes. 

Microdeletion Syndromes 

The deletions described thus far all involve relatively large 
segments of chromosomes, and many of them were described 
before chromosome banding techniques were developed. 
With the advent of high-resolution banding, it has become 
possible to identify microscopically a large number of dele¬ 
tions that were previously too small for detection. In addi¬ 
tion, advances in molecular genetics, particularly the FISH 
and aCGH techniques (see Fig. 6-4), have permitted the 
detection of deletions that are often too small to be observed 
microscopically (i.e., <5 Mb). 

Prader-Willi syndrome, a disorder discussed in Chapter 5, 
is a good example of a microdeletion syndrome. Although 
this condition was described in the 1950s,* it was not until 


^Although Prader is commonly credited with the first complete 
description of Prader-Willi syndrome in 1956, John Langdon Down 
(of Down syndrome fame) published a fairly complete description 
of the disorder in 1887. 
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FIG 6-17 The possible segregation patterns for the gametes formed by a carrier of a Robertso¬ 
nian translocation. Alternate segregation (quadrant a alone, or quadrant b with quadrant c) pro¬ 
duces either a normal chromosome constitution or a translocation carrier with a normal phenotype. 
Adjacent segregation (quadrant a with b, quadrant c alone, quadrant a with c, or quadrant b alone) 
produces unbalanced gametes and results in conceptions with translocation Down syndrome, 
monosomy 21, trisomy 14, or monosomy 14, respectively. For example, monosomy 14 is pro¬ 
duced when the parent who carries the translocation transmits a copy of chromosome 21 but 
does not transmit a copy of chromosome 14 (as in the lower right corner). 
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1981 that advanced banding techniques detected a small dele¬ 
tion of chromosome bands 15qll-ql3 in about 50% of these 
patients. With the use of molecular techniques, deletions that 
were too small to be detected cytogenetically were also dis¬ 
covered. In total, about 70% of Prader-Willi cases are caused 
by microdeletions of 15q. 



FIG 6-18 Child with Wolf-Hirschhorn syndrome (46,XX, 
dei[4p]). Note the wide-spaced eyes and repaired cleft lip. 


Because of imprinting, the inheritance of a microdeletion 
of the paternal chromosome 15 material produces Prader- 
Willi syndrome, while a microdeletion of the maternally 
derived chromosome 15 produces the phenotypically distinct 
Angelman syndrome (see Chapter 5). 

Williams syndrome, which is characterized by mild intel¬ 
lectual disability, supravalvular aortic stenosis (SVAS), mul¬ 
tiple peripheral pulmonary arterial stenoses, characteristic 
facial features, dental anomalies, and hypercalcemia, is 
another example of a microdeletion syndrome (Fig. 6-19). A 
series of molecular analyses have identified some of the indi¬ 
vidual genes responsible for the Williams syndrome pheno¬ 
type. The gene that encodes elastin, ELN, for example, is 
located in the Williams syndrome critical region and is 
expressed in blood vessels. Elastin is an important compo¬ 
nent of the aortic wall (microfibrils, which were discussed in 
Chapter 4 in the context of Marfan syndrome, are another 
component). Mutations or deletions of elastin alone result 
in isolated SVAS without the other features of Williams 
syndrome. Larger deletions, encompassing additional genes, 
produce the complete Williams syndrome phenotype. A 
second gene in the critical region, LIMKl, encodes a brain- 
expressed kinase that may be involved in the visual-spatial 
cognition defects observed in patients with Williams syn¬ 
drome. This is supported by observation of patients with 
partial deletions of the critical region affecting only the ELN 
and LIMKl genes. These persons have SVAS and visual- 
spatial cognitive deficiency but none of the other features of 
Williams syndrome. 

High-resolution banding and molecular genetic tech¬ 
niques have often led to a more precise specification of the 
critical chromosome region that must be deleted to cause a 
given syndrome. Wolf-Hirschhorn syndrome, for instance. 



FIG 6-19 A, Girl with Williams syndrome, illustrating typical facial features: broad forehead, short 
palpebral fissures, low nasal bridge, anteverted nostrils, long philtrum, full cheeks, and relatively 
large mouth with full lips. B, Angiogram illustrating supravalvular aortic stenosis (narrowing of 
the ascending aorta) (arrow). (Courtesy Dr. Mark Keating, Harvard University.) 
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TABLE 6-3 Microdeletion Syndromes 


SYNDROME 

CLINICAL FEATURES 

CHROMOSOMAL DELETION 

Prader-Willi 

Intellectual disability, short stature, obesity, hypotonia, 
characteristic facies, small feet 

15q11-13 

Angelman 

Intellectual disability, ataxia, uncontrolled laughter, seizures 

15q11-13 

Langer-Giedion 

Characteristic facies, sparse hair, exostosis, variable intellectual 
disability 

8q24 

Miller-Dieker 

Lissencephaly, characteristic facies, significant cognitive and 
psychomotor disability 

17p13.3 

Velocardiofacial/DiGeorge 

Characteristic facies, cleft palate, heart defects, poorly developed 
thymus 

22q11 

Smith-Magenis 

Intellectual disability, hyperactivity, dysmorphic features, self¬ 
destructive behavior 

17p11.2 

Williams 

Developmental disability, characteristic facies, supravalvular aortic 
stenosis 

7q1 

Aniridia, Wilms tumor 

Intellectual disability, predisposition to Wilms tumor, genital 
defects 

11p13 

Deletion 1p36 

Intellectual disability, seizures, hearing loss, heart defects, growth 
failure, distinctive facial features 

1p36 

Rubinstein-Taybi 

Intellectual disability, broad thumbs and great toes, characteristic 
facial features, vertebral and sternal abnormalities, heart defects 

16p13.3 

Alagille 

Neonatal jaundice, "butterfly" vertebrae, pulmonic valvular 
stenosis, characteristic facial features 

20p12 


*For most of these conditions, only some cases are caused by the listed microdeletion; other cases may be caused by single-gene 
mutations within the same region. 


can be produced by the deletion of only a very small intersti¬ 
tial segment of 4p, just 2 megabases from the telomere. In 
some instances, specific genes responsible for chromosome 
abnormality syndromes can be pinpointed. For example, 
persons with a deletion of lip can present with a series of 
features including Wilms tumor (a kidney tumor), aniridia 
(absence of the iris), genitourinary abnormalities,^ and 
“mental retardation” (sometimes termed the WAGR syn¬ 
drome). The genes responsible for the kidney tumor and for 
aniridia have now each been identified and cloned. Because 
the WAGR syndrome involves the deletion of a series of adja¬ 
cent genes, it is sometimes referred to as an example of a 
contiguous gene syndrome (Williams syndrome, discussed 
previously, is another example of a contiguous gene syn¬ 
drome). In addition to microdeletions, microduplications 
can also produce contiguous gene syndromes. 

Some of the microdeletion syndromes, such as the Prader- 
Willi and Williams syndromes, manifest deletions of a critical 
region of consistent size (e.g., 4 Mb for Prader-Willi syn¬ 
drome). Recent studies show that this is caused by the pres¬ 
ence of multiple repeated sequences, termed low-copy repeats 
(see Chapter 2), at the deletion boundaries. These repeated 
sequences promote unequal crossing over (see Chapter 5), 


^Because individuals with the WAGR syndrome also have gonado- 
blastomas (gonadal tumors), some authorities believe that the 
“G” should stand for gonadoblastoma rather than genitourinary 
abnormality. 


which then produces duplications and deletions of the region 
bounded by the repeat elements. 

Several additional examples of microdeletions are given in 
Table 6-3. Many of these conditions, including the Prader- 
Willi, Miller-Dieker, Williams, and velocardiofacial syn¬ 
dromes (Clinical Commentary 6-3), are now diagnosed using 
the FISH or aCGH techniques. 

Microdeletions are a subtype of chromosome 
deletion that can be observed only in banded 
chromosomes or, in some cases, using molec¬ 
ular genetic approaches. Syndromes caused 
by the deletion of a series of adjacent 
genes are sometimes called contiguous gene 
syndromes. 

Subtelomeric Rearrangements 

The regions near telomeres of chromosomes tend to have a 
high density of genes. Consequently, rearrangements of 
genetic material (e.g., deletions, duplications) in these regions 
often result in genetic disease. It is estimated that at least 5% 
of unexplained cases of intellectual disability are caused by 
subtelomeric rearrangements. The most common of these 
rearrangements is a deletion of several thousand bases of 
chromosome lp36, which is seen in approximately one in 
5000 live births. This condition, termed monosomy lp36 
syndrome, is associated with significant intellectual disability, 
developmental delay, seizures, hearing impairment, heart 
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CLINICAL COMMENTARY 6-3 

DiGeorge Sequence, Velocardiofacial Syndrome, and Microdeletions of Chromosome 22 


The DiGeorge sequence* is characterized by structural or func¬ 
tional defects of the thymus, conotruncal heart defects, hypo¬ 
parathyroidism (reduced parathyroid function), and secondary 
hypocalcemia (decreased serum calcium). This pattern of mal¬ 
formations is caused by an alteration of the embryonic migra¬ 
tion of neural crest cells to the developing structures of the 
neck. In the 1980s some children with DiGeorge sequence 
were found to have a deletion of part of the long arm of chro¬ 
mosome 22, often related to an unbalanced translocation 
between this and another chromosome. This led to the hypoth¬ 
esis that genes on chromosome 22 were responsible for this 
condition. 

Independent of this work, a condition called the velocardiofa¬ 
cial (VCF) syndrome, or Shprintzen syndrome, was described 
in the late 1970s. This syndrome includes palatal (velum) abnor¬ 
malities (including cleft palate), a characteristic facial appear¬ 
ance (Fig. 6-20), and. In some cases, heart malformations. In 
addition, these patients have learning disabilities and develop¬ 
mental delay. Later It was discovered that some persons with 
VCF have dysfunctional T cells (these cells mature In the 
thymus), and some have all features of DiGeorge sequence. 
This suggested that DiGeorge sequence was somehow related 
to VCF syndrome. 

The resemblance between the DiGeorge sequence and VCF 
syndrome led to the hypothesis that both conditions were 
caused by abnormalities on chromosome 22. High-resolution 
chromosome studies, including FISH, of patients with DiGeorge 
sequence and patients with VCF syndrome revealed small sub- 
microscopic deletions of chromosome 22 in both groups. 
These analyses also helped to narrow the critical region that 
causes both conditions. Approximately 80% to 90% of Infants 
with DiGeorge sequence have a 3-Mb microdeletion of the 
22q11.2 region, and most VCF patients have the same micro¬ 
deletion. In addition, 15% to 20% of persons with isolated 
conotruncal defects exhibit this deletion. Thus, most persons 
with either DiGeorge sequence or VCF syndrome have a micro¬ 
deletion of 22qll.2 and are collectively described as having 
22ql 1.2 deletion syndrome. With a prevalence of 1 in 3000 to 
4000 live births, this is the most common human microdeletion 
syndrome. 

About 90% of persons with 22ql 1.2 microdeletions lack the 
same 3-Mb region, containing about 35 genes. Another 8% 
have a smaller, 1.5-Mb deletion located within the 3-Mb region. 
No consistent phenotypic differences are seen between these 
two groups of patients. Both the 1.5- and 3-Mb regions are 


flanked by low-copy repeats that are thought to promote 
unequal crossing over, and thus deletion, in this region. One 
of the genes located In the deleted region, 76X7, encodes a 
transcription factor that helps to regulate the migration of 
neural crest cells and the development of facial structures, the 
thymus, the parathyroid, and the heart. In mouse models, Tbxl 
haploinsufficiency produces many of the features of DiGeorge 
sequence and VCF syndromes. 

This example Illustrates how cytogenetic studies can demon¬ 
strate potential biological relationships between genetic syn¬ 
dromes. Further studies are under way to characterize the 
individual genes in this region and how they contribute to the 
phenotypic variation seen in DiGeorge sequence and VCF 
syndromes. 



FIG 6-20 Face of a boy with 22qll deletion syndrome. 
Note the narrow, tall nasal root and nasal bridge and the 
somewhat smooth philtrum. (Courtesy Dr. Lynne M. Bird, 
Children's Hospital, San Diego.) 


*A sequence is defined as a series of alterations that are all due to a single, primary defect (see Chapter 15 for further discussion). In 
DiGeorge sequence, the primary defect is an abnormality in the migration of neural crest cells, and its secondary alterations are the 
anomalies of the thymus, parathyroids, and heart. 


defects, hypotonia, and characteristic facial features (Fig. 
6 - 21 ). 

Collections of probes have been designed so that FISH 
analysis of metaphase chromosomes can be undertaken to 
determine whether a subtelomeric deletion or duplication 
has occurred in a patient. In most clinical applications aCGH 


is carried out, hybridizing differentially labeled patient and 
control DNA samples to microarrays that contain probes cor¬ 
responding to all human subtelomeric regions. If a subtelo¬ 
meric region is duplicated or deleted, the patient’s DNA will 
show either excessive or deficient hybridization to the probe 
corresponding to that region. 
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FIG 6-21 Face of young boy with 1q36 deletion syndrome. 
Note the horizontal eyebrows, deep-set eyes, broad nasal 
root, and pointed chin. 


Subtelomeric rearrangements involve dele¬ 
tions or duplications of DNA in the gene-rich 
regions near telomeres. They can be detected 
by hybridizing specifically designed FISH 
probes to metaphase chromosomes or by 
comparative genomic hybridization of patient 
and control DNA to microarrays containing 
subtelomeric probes. 

Uniparental Disomy 

As discussed earlier, about 70% of cases of Prader-Willi syn¬ 
drome are caused by microdeletions. Most of the remaining 
cases involve uniparental disomy {di = “two”), a condition 
in which one parent has contributed two copies of a 
chromosome and the other parent has contributed no copies 
(Fig. 6-22). If the parent has contributed two copies of one 
homolog, the condition is termed isodisomy. If the parent 
has contributed one copy of each homolog, it is termed het¬ 
erodisomy. Isodisomy or heterodisomy of an imprinted 
chromosome can cause diseases such as Prader-Willi 



Father 



Mother 



Father 



Mother 




A 



Zygote with 
uniparental 
disomy 



B 



Chromosome loss 
produces cells with 
uniparental disomy 


FIG 6-22 Two mechanisms that can produce uniparental disomy. A, Parental nondisjunction 
produces a sperm cell with two copies of a specific chromosome, and maternal nondisjunction 
produces an ovum with no copies of the same chromosome. The resulting zygote has two copies 
of the father's chromosome and no copies of the mother's chromosome (in this example the 
father contributes both chromosomes, but it is also possible that the mother could contribute 
both chromosomes). B, Nondisjunction (in the mother, in this example) results in a trisomic 
zygote. Loss of the paternal chromosome during mitosis produces embryonic cells that have 
two copies of the mother's chromosome. 
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syndrome (i.e., the inheritance of two copies from the mother 
and none from the father means that the offspring receives 
no active paternal genes in the imprinted region; see Chapter 
5). Isodisomy can result in autosomal recessive disease in the 
offspring of a heterozygous parent if the parent contributes 
two copies of the chromosome homolog that contains the 
disease-causing mutation (see Fig. 6-22). The first docu¬ 
mented case of uniparental disomy in a human was seen in 
a patient with cystic fibrosis whose heterozygous carrier 
parent had transmitted two copies of chromosome 7 contain¬ 
ing a mutated CPTR gene, while the other parent transmitted 
neither copy of chromosome 7. 

Uniparental disomy can arise in a number of ways. A tri- 
somic conception can lose one of the extra chromosomes, 
resulting in an embryo that has two copies of the chromo¬ 
some contributed by one parent. Disomy can also result from 
the union of a gamete that contains two copies of a specific 
chromosome with a gamete that contains no copies of that 
chromosome (see Fig. 6-22). In the early embryo, cells with 
uniparental disomy can be produced by mitotic errors, such 
as chromosome loss with subsequent duplication of the 
homologous chromosome. In addition to the Prader-Willi 
and Angelman syndromes and cystic fibrosis, uniparental 
disomy has been seen in cases of Russell-Silver syndrome, 
hemophilia A (see Chapter 5), and Beckwith-Wiedemann 
syndrome (see Chapters 5 and 15). 

Duplications 

A partial trisomy, or duplication, of genetic material may be 
seen in the offspring of persons who carry a reciprocal trans¬ 
location. Duplications can also be caused by unequal cross¬ 
over during meiosis, as described for the X-linked color 
vision loci (see Chapter 5) and for Charcot-Marie-Tooth 
disease (see Chapter 3). Duplications tend to produce less- 
serious consequences than deletions, again illustrating the 
principle that a loss of genetic material is more serious than 
an excess of genetic material. 

Duplications can arise from unequal cross¬ 
over, or they can occur among the offspring of 
reciprocal translocation carriers. Duplications 
generally produce less-serious consequences 
than do deletions of the same region. 

Ring Chromosomes 

Deletions sometimes occur at both tips of a chromosome. 
The remaining chromosome ends can then fuse, forming a 
ring chromosome (Fig. 6-23). The karyotype of a female 
with a ring X chromosome is 46,X,r(X). If the ring chromo¬ 
some includes a centromere, it can often proceed through cell 
division, but its structure can create difficulties. Ring chro¬ 
mosomes are often lost, resulting in monosomy for the chro¬ 
mosome in at least some cells (i.e., mosaicism for the ring 
chromosome may be seen). Ring chromosomes have been 
described in at least one case for each of the human auto- 
somes; rings 14 and 22 represent two of the most common 
ring syndromes in humans. 



Normal 

FIG 6-23 Both tips of a chromosome can be lost, leaving 
sticky ends that attach to each other, forming a ring chromo¬ 
some. A chromosome 12 ring is shown here. 


Inversions 

An inversion is the result of two breaks on a chromosome 
followed by the reinsertion of the intervening fragment at its 
original site but in inverted order. Thus, a chromosome sym¬ 
bolized as ABCDEFG might become ABEDCFG after an 
inversion. If the inversion includes the centromere, it is called 
a pericentric inversion. Inversions that do not involve the 
centromere are termed paracentric inversions. 

Like reciprocal translocations, inversions are a balanced 
structural rearrangement. Consequently, they seldom produce 
disease in the inversion carrier (recall from Chapter 5, though, 
that an inversion that interrupts the factor VIII gene produces 
severe hemophilia A). Inversions can interfere with meiosis, 
however, producing chromosome abnormalities in the off¬ 
spring of inversion carriers. Because chromosomes must line 
up in perfect order during prophase 1, a chromosome with 
an inversion must form a loop to line up with its normal 
homolog (Fig. 6-24). Crossing over within this loop can result 
in duplications or deletions in the chromosomes of daughter 
cells. Thus, the offspring of persons who carry inversions 
often have chromosome deletions or duplications. It is esti¬ 
mated that about I in 1000 people carries an inversion and 
is therefore at risk for producing gametes with duplications 
or deletions. 

Figure 6-24 gives an example of a pericentric inversion on 
chromosome 8 (46,XX, inv[8]). About 5% of the offspring of 
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Parent Offspring 

FIG 6-24 A pericentric inversion in chromosome 8 causes the formation of a loop during the 
alignment of homologous chromosomes in meiosis. Crossing over in this loop can produce 
duplications or deletions of chromosome material in the resulting gamete. The offspring in the 
lower right received one of the recombinant 8 chromosomes from this parent. 


persons who carry this inversion receive a deletion or dupli¬ 
cation of the distal portion of 8q. This combination results 
in the recombinant 8 syndrome, which is characterized by 
cognitive disability, heart defects, seizures, and a characteris¬ 
tic facial appearance. 

Chromosome inversions a re relatively common 
structural abnormalities and may be either 
pericentric (including the centromere) or para¬ 
centric (not including the centromere). Parents 
with inversions are usually normal in pheno¬ 
type but can produce offspring with deletions 
or duplications. 

Isochromosomes 

Sometimes a chromosome divides along the axis perpendicu¬ 
lar to its usual axis of division (Fig. 6-25). The result is an 
isochromosome, a chromosome that has two copies of one 
arm and no copies of the other. Because the genetic material 
is substantially altered, isochromosomes of most autosomes 


are lethal. Most isochromosomes observed in live births 
involve the X chromosome, and babies with isochromosome 
Xq (46,X,i[Xq]) usually have features of Turner syndrome. 
Isochromosome 18q, which produces an extra copy of the 
long arm of chromosome 18, has been observed in infants 
with Edwards syndrome phenotype. Although most isochro¬ 
mosomes appear to be formed by faulty division, they can 
also be created by Robertsonian translocations of homolo¬ 
gous acrocentric chromosomes (e.g., a Robertsonian translo¬ 
cation of the two long arms of chromosome 21). 

CHROMOSOME ABNORMALITIES AND 
CLINICAL PHENOTYPES 

As we have seen, most autosomal aberrations induce consis¬ 
tent recognizable patterns of multiple malformations, 
minor anomalies, and phenotypes with variable degrees of 
developmental disability. Although the individual features are 
usually nonspecific (e.g., palmar creases can be seen in both 
Down syndrome and trisomy 18), the general pattern 
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FIG 6-25 Top, Normal chromosome division. Center, An iso¬ 
chromosome is formed when a chromosome divides along 
an axis perpendicular to its usual axis of division. This pro¬ 
duces one chromosome with only the short arms and another 
with only the long arms. Bottom, A normal X chromosome is 
compared with an isochromosome of Xq. 

of features is usually distinctive enough to permit a clinical 
diagnosis. This is especially true of the well-known chromo¬ 
some syndromes: Down syndrome, Edwards syndrome, Patau 
syndrome, and Turner syndrome. However, there is consider¬ 
able phenotypic variability even within these syndromes. No 
one patient has every feature; most congenital malformations 
(e.g., heart defects) are seen in only some affected individuals. 
This phenotypic variability, and the attendant potential for 



misdiagnosis, underscores the need to order a chromosome 
analysis (karyotype or aCGH) whenever clinical features 
suggest a chromosome abnormality. 

Usually the biological basis for phenotypic variability is 
unknown, although mechanisms such as mosaicism, which 
often leads to milder expression, are being uncovered. The 
basis of variable expression of chromosome syndromes will 
be better understood as the individual genes involved in these 
abnormalities are identified and characterized. 

In spite of the variability of chromosome syndromes, it is 
possible to make several generalizations: 

• Most chromosome abnormalities (especially those involv¬ 
ing autosomes) are associated with developmental delay in 
children and intellectual disability in older persons. This 
reflects the fact that a large number of human genes, 
perhaps one third or more of the total, participate in the 
development of the central nervous system. Therefore, a 
chromosome abnormality, which typically can affect hun¬ 
dreds of genes, is very likely to involve genes that affect 
nervous system development. 

• Most chromosome syndromes involve alterations of facial 
morphogenesis that produce characteristic facial features. 
For this reason, the patient often resembles other persons 
with the same disorder more than members of his or her 
own family. Usually the facial features and minor anoma¬ 
lies of the head and limbs are the best aids to diagnosis 
(see Chapter 15). 

• Growth delay (short stature and/or poor weight gain in 
infancy) is commonly seen in autosomal syndromes. 

• Congenital malformations, especially congenital heart 
defects, occur with increased frequency in most autosomal 
chromosome disorders. These defects occur in specific 
patterns. For example, AV canals and VSDs are common 
in children with Down syndrome. Other congenital heart 
defects, such as aortic coarctation or hypoplastic (under¬ 
developed) left ventricle, are seldom seen in these children 
but may be seen in those with Turner syndrome. 

The most common clinical indications for a chromosome 
analysis are a newborn with multiple congenital malforma¬ 
tions or a child with developmental delay. A summary of 
clinical situations in which a chromosome evaluation should 
be considered is given in Box 6-1. 



Chromosome abnormalities typically result in 
developmental delay, intellectual disability, 
characteristic facial features, and various types 
of congenital malformations. Despite some 
overlap of phenotypic features, many chromo¬ 
some abnormalities can be recognized by clini¬ 
cal examination. 


CANCER CYTOGENETICS 

Most of the chromosome abnormality syndromes discussed 
thus far are caused by errors that occur in the meiotic process 
leading to gamete formation. Chromosome rearrangements 
can also take place in somatic cells; these are responsible for 
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BOX 6-1 Indications for Performing 
Chromosome Analysis 


• Persons with a suspected recognizable chromosome syn¬ 
drome (e.g., Down syndrome) 

• Persons with an unrecognizable pattern of two or more 
malformations 

• Persons with ambiguous genitalia 

• Intellectual disability or developmental delay In children 
who have physical abnormalities 

• Parents and children of persons with chromosomal trans¬ 
locations, deletions, or duplications 

• Stillborn infants with malformation or with no recognizable 
reason for fetal death 

• Females with proportionate short stature and primary 
amenorrhea (consider Turner syndrome) 

• Males with small testes or significant gynecomastia (con¬ 
sider Klinefelter syndrome) 


a number of important cancers in humans. The first of these 
to be recognized was a chromosome alteration seen consis¬ 
tently in patients with chronic myelogenous leukemia (CML). 
Initially, it was suggested that the chromosome alteration was 
a deletion of the long arm of either chromosome 21 or chro¬ 
mosome 22. With the subsequent development of chromo¬ 
some banding techniques, the abnormality was identified as 
a reciprocal translocation between chromosomes 9 and 22. 
The Philadelphia chromosome, as this translocation is com¬ 
monly known, consists of a translocation of most of chromo¬ 
some 22 onto the long arm of chromosome 9. A small distal 
portion of 9q in turn is translocated to chromosome 22. The 
net effect is a smaller chromosome 22, which explains why 
the Philadelphia chromosome was initially thought to be a 
deletion. This translocation (Fig. 6-26) is seen in most cases 
of CML. 

Much has been learned about the effects of this transloca¬ 
tion by isolating the genes that are located near the transloca¬ 
tion breakpoints (i.e., the locations on the chromosomes at 
which the breaks occur preceding translocation). A proto¬ 
oncogene (see Chapter 11) called ABL is moved from its 
normal position on 9q to 22q. This alters the ABL gene 
product, causing increased tyrosine kinase activity, which 
leads to malignancy in hematopoietic cells (i.e., cells that 
form blood cells such as lymphocytes). Drugs have now been 
developed to inhibit the tyrosine kinase encoded by this gene, 
offering much more effective treatment for CML. 

A second example of a translocation that produces cancer 
is given by Burkitt lymphoma, a childhood jaw tumor. In this 
case, a reciprocal translocation involving chromosomes 8 and 
14 moves the MYC proto-oncogene from 8q24 to 14q32, near 
the immunoglobulin heavy chain loci (see Chapter 9). Tran¬ 
scription regulation sequences near the immunoglobulin 
genes then activate MYC, causing malignancies to form. 

More than 100 different rearrangements, involving nearly 
every chromosome, have been observed in more than 40 dif¬ 
ferent types of cancer. Some of these are summarized in Table 
6-4. Sometimes these translocations are identified using spec- 



Normal der(22) 

22 

FIG 6-26 Reciprocal translocation between chromosome 22 
and the long arm of chromosome 9 (the Philadelphia chromo¬ 
some). The occurrence of this translocation in hematopoietic 
cells can produce chronic myelogenous leukemia. 


TABLE 6-4 Specific Cytogenetic 

Changes Observed in Selected 

Leukemias and Solid Tumors 

TYPE 

MOST COMMON 

CHROMOSOME 

ABERRATION 

Leukemias 

Chronic myelogenous leukemia 

t(9;22)(q34;q11) 

Acute myeloblastic leukemia 

t(8;21)(q22;q22) 

Acute promyelocytic leukemia 

t(15:17)(q22:q11-12) 

Acute lymphocytic leukemia 

t(12:21)(pl3:q22) 

Solid Tumors 

Burkitt lymphoma 

t(8;14)(q24;q32) 

Ewing sarcoma 

t(11:22)(q24:q12) 

Meningioma 

Monosomy 22 

Retinoblastoma 

del(13)(q14) 

Wilms tumor 

del(11)(p13) 

Neuroblastoma 

N-MYC amplification 

Breast cancer 

HER2/NEU amplification 


tral karyotypes. In many cases, identification of the chromo¬ 
some rearrangement leads to a more accurate prognosis and 
better therapy. Hence, the cytogenetic evaluation of bone 
marrow cells from leukemia patients is a routine part of diag¬ 
nosis. Furthermore, identification and characterization of the 
genes that are altered in translocation syndromes are leading 
to a better understanding of carcinogenesis in general. 
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Balanced translocations in somatic cells can 
sometimes cause malignancies by interrupting 
or altering genes ortheir regulatory sequences. 

CHROMOSOME INSTABILITY SYNDROMES 

Several autosomal recessive disease conditions exhibit an 
increased incidence of chromosome breaks under specific 
laboratory conditions. These conditions, which are 
termed chromosome instability syndromes, include ataxia- 
telangiectasia, Bloom syndrome, Fanconi anemia, and xero¬ 
derma pigmentosum (see Chapter 2). Among patients with 
Fanconi anemia, the frequency of breaks can be increased 


further if the chromosomes are exposed to certain alkylating 
agents. Patients with Bloom syndrome also have a high inci¬ 
dence of somatic cell sister chromatid exchange (exchange of 
chromosome material between sister chromatids; see Chapter 
2). Each of these syndromes is associated with a significant 
increase in cancer risk. All are thought to be the result of 
faulty DNA replication or repair, as discussed in Chapter 2. 

The chromosome instability syndromes all 
involve increased frequencies of chromosome 
breakage and an increased risk of malignancy. 
All are associated with defects in DNA replica¬ 
tion or repair. 




I STUDY QUESTIONS 


1. Distinguish among haploidy, diploidy, polyploidy, 
euploidy, and aneuploidy. 

2. Explain the uses and relative advantages of FISH, spectral 
karyotyping, and comparative genomic hybridization 
(CGH). 

3. Describe three ways in which triploidy could arise. 

4. Studies of karyotypes obtained by prenatal diagnosis at 
10 weeks’ gestation (chorionic villus sampling; see 
Chapter 13) reveal prevalence rates of chromosome 
abnormalities that differ from those obtained in karyo¬ 
types at 16 weeks’ gestation (amniocentesis; see Chapter 
13). Explain this. 

5. Even though conditions such as Down syndrome and 
Edwards syndrome can usually be diagnosed accurately 
by clinical examination alone, a karyotype is always rec¬ 
ommended. Why? 

6. Rank the following, from lowest to highest, in terms of 
the risk of producing a child with Down syndrome: 

45-year-old woman with no previous family history 
of Down syndrome 


25-year-old woman who has had one previous child 
with Down syndrome 

25-year-old male carrier of a 21/14 Robertsonian 
translocation 

25-year-old female carrier of a 21/14 Robertsonian 
translocation 

7. Females with the 49,XXXXX karyotype have been 
reported. Explain how this karyotype could occur. 

8. A man with hemophilia A and a normal woman produce 
a child with Turner syndrome (45,X). The child has 
normal factor VIII activity. In which parent did the 
meiotic error occur? 

9. A cytogenetics laboratory reports a karyotype of 
46,XY,del(8)(pll) for one patient and a karyotype of 
46,XY,dup(8)(pll) for another patient. Based on this 
information alone, which patient is expected to be more 
severely affected? 

10. Why do translocations in somatic cells sometimes lead to 
cancer? 
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CHAPTER 


Biochemical Genetics: 
Disorders of Metabolism 


Each of us is composed of a large number of complex mol¬ 
ecules that are hierarchically arranged in space to form cells, 
tissues, organs, and ultimately a complete human being. 
These molecules are constructed from individual elements 
that may be synthesized endogenously or obtained from the 
environment. Once created, these molecules are not static. 
In fact, they are perpetually being synthesized, degraded, 
excreted, and sometimes recycled in a tightly choreographed 
metabolic dance. 

Each metabolic process consists of a sequence of catalytic 
steps mediated by enzymes encoded by genes. Usually, these 
genes are replicated with high fidelity, and enzymatic systems 
continue to work effectively from generation to generation. 
Occasionally, mutations reduce the efficiency of encoded 
enzymes to a level at which normal metabolism cannot occur. 
Such variants of metabolism were recognized by Sir Archibald 
Garrod at the beginning of the 20th century, based partly 
on his studies of alkaptonuria (AKU). Garrod recognized 
that these variants illustrated “chemical individualities,” 
and he called these disorders “inborn errors of metabolism,” 
thus setting the cornerstone for contemporary biochemical 
genetics. 

AKU is a rare disorder in which homogentisic acid (HGA), 
an intermediate metabolite in phenylalanine and tyrosine 
metabolism (Fig. 7-1), is excreted in large quantities in urine, 
causing it to darken on standing. Hence, AKU was classically 
referred to as “black urine disease.” Additionally, an oxida¬ 
tion product of HGA is directly deposited in connective 
tissues, resulting in abnormal pigmentation and debilitating 
arthritis. 

Garrod proposed in 1902 that AKU was caused by a defi¬ 
ciency of the enzyme that normally splits the aromatic ring 
of HGA. Fifty years later, it was established that AKU is pro¬ 
duced by a failure to synthesize homogentisate 1,2-dioxygenase 
(HGO). However, it was not until 1996 that scientists identi¬ 
fied the gene that is altered in AKU, based on homology to a 
gene encoding an HGO enzyme that was isolated from a 
fungus species. The coding region of HGO comprises 14 
exons distributed over 54 kb of DNA. Many of the mutations 
identified in HGO encode proteins that show no HGO activ¬ 
ity when expressed in vitro. This indicates that AKU is caused 
by a loss-of-function mutation, confirming the hypothesis 
put forth by Garrod more than a century ago. 
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Almost all biochemical processes of human 
metabolism are catalyzed by enzymes. Varia¬ 
tions of enzymatic activity among humans are 
common, and a minority of these variants 
causes disease. These concepts were intro¬ 
duced by Archibald Garrod and exemplified by 
his studies of alkaptonuria. 

VARIANTS OF METABOLISM 
Prevalence of Metabolic Disease 

Hundreds of different inborn errors of metabolism have been 
described to date, and most of these are rare. Taken together, 
however, metabolic disorders account for a substantial per¬ 
centage of the morbidity and mortality directly attributable 
to genetic disease (Table 7-1). One survey conservatively esti¬ 
mated that the incidence of metabolic disorders is approxi¬ 
mately 1 in every 2500 births, or 10% of all monogenic 
conditions in children. Moreover, we are beginning to under¬ 
stand that different alleles of genes that encode enzymes can 
alter one’s risk for many common diseases, such as diabetes, 
heart disease, stroke, and cancer. 

The diagnosis of a metabolic disorder can be challenging 
(Clinical Commentary 7-1); thus, the morbidity associated 
with metabolic defects is probably underestimated. In the 
1970s, an often fatal acute metabolic encephalopathy called 
Reye syndrome was diagnosed in many children. In the fol¬ 
lowing decades, we learned that some children with an 
encephalopathy indistinguishable from Reye syndrome had a 
urea cycle defect that produced hyperammonemia (increased 
levels of circulating ammonia) and death. Recognition of 
Reye syndrome as a phenocopy of a urea cycle defect is 
important because, in addition to supportive care,* the chil¬ 
dren can now receive direct treatment for the urea cycle 
defects. Similarly, on postmortem examination, some chil¬ 
dren who have died from sudden infant death syndrome 
(SIDS) have been found to have a defect of fatty acid metabo¬ 
lism. These are also treatable disorders, and life-threatening 
episodes can be avoided with appropriate care. 


^Supportive care is care that supports the elementary functions of the 
body, such as maintenance of fluid balance, oxygenation, and blood 
pressure, but is not aimed at treating the disease process directly. 
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FIG 7-1 Major pathway of phenylalanine metabolism. Differ¬ 
ent enzymatic defects in this pathway cause classic phenyl¬ 
ketonuria (V, tyrosinase-negative oculocutaneous albinism 
(2), alkaptonuria (3), and tyrosinemia type 1 (4). 

Although individual metabolic disorders are 
rare, their overall direct and indirect contribu¬ 
tion to morbidity and mortality is substantial. 

Inheritance of Metabolic Defects 

Most metabolic disorders are inherited in an autosomal 
recessive pattern: Only individuals having two mutant alleles 
are affected. Although a mutant allele produces reduced or 
no enzyme activity (loss of function), it usually does not alter 
the health of a heterozygous carrier. Because many of the 
genes encoding disease-related enzymes have been identified 
and their mutations characterized, carrier testing and prena¬ 
tal diagnosis are available for many metabolic disorders. 
However, testing samples of dried blood for elevated levels of 
metabolites in the newborn period (e.g., for phenylketonuria 
and galactosemia; see Chapter 13) remains the most com¬ 
monly used population-based screening test for metabolic 
disorders. Expanded newborn screening that tests for dozens 
of different disorders by checking for the presence of abnor¬ 
mal metabolites in blood is common. As the technology for 
rapid and efficient DNA testing of mutant alleles progresses, 
population-based screening for additional disorders is likely 
to be incorporated. 


Most inborn errors of metabolism are inher¬ 
ited in an autosomal recessive pattern. 
The carrier state usually is not associated 
with morbidity. Carrier and diagnostic testing 
are becoming widely available for many 
disorders. 

Types of Metabolic Processes 

Metabolic disorders have been classified in many different 
ways, based on the pathological effects of the pathway blocked 
(e.g., absence of end product, accumulation of substrate); 
different functional classes of proteins (e.g., receptors, hor¬ 
mones); associated cofactors (e.g., metals, vitamins); and 
pathways affected (e.g., glycolysis, citric acid cycle). Each of 
these has advantages and disadvantages, and none of them 
encompasses all metabolic disorders. However, the classifica¬ 
tion that most completely integrates our knowledge of cell 
biology, physiology, and pathology with metabolic disorders 
categorizes defects of metabolism by the types of processes 
that are disturbed. 

DEFECTS OF METABOLIC PROCESSES 

Almost all biochemical reactions in the human body are con¬ 
trolled by enzymes, which act as catalysts. The catalytic prop¬ 
erties of enzymes typically increase reaction rates by more 
than a million-fold. These reactions mediate the synthesis, 
transfer, use, and degradation of biomolecules to build and 
maintain the internal structures of cells, tissues, and organs. 
Biomolecules can be categorized into four primary groups: 
nucleic acids, proteins, carbohydrates, and lipids. The major 
metabolic pathways that metabolize these molecules include 
glycolysis, citric acid cycle, pentose phosphate shunt, gluco- 
neogenesis, glycogen and fatty acid synthesis and storage, 
degradative pathways, energy production, and transport 
systems. We now discuss how defects in each of these meta¬ 
bolic pathways can cause human disease. 

Carbohydrate Metabolism 

Because of the many different applications that they serve in 
all organisms, carbohydrates are the most abundant organic 
substance on Earth. Carbohydrates function as substrates for 
energy production and storage, as intermediates of metabolic 
pathways, and as the structural framework of DNA and RNA. 
Consequently, carbohydrates account for a major portion of 
the human diet and are metabolized into three principal 
monosaccharides: glucose, galactose, and fructose. Galactose 
and fructose are converted to glucose before glycolysis. The 
failure to effectively use these sugars accounts for the majority 
of the inborn errors of human carbohydrate metabolism. 

Galactose 

The most common monogenic disorder of carbohydrate 
metabolism, transferase deficiency galactosemia (classic 
galactosemia), affects 1 in every 50,000 newborns. It is 
most commonly caused by mutations in the gene encoding 
galactose-1-phosphate uridyl transferase (GAL-l-P uridyl 
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TABLE 7-1 Disorders of Metabolism 






CHROMOSOMAL 

NAME 

PREVALENCE 

MUTANT GENE PRODUCT 

LOCATION 

Carbohydrate Disorders 

Classic galactosemia 

1/35,000 to 1/60,000 

Galactose-1-phosphate uridyl 
transferase 

9p13 

Hereditary fructose intolerance 

1/20,000 

Fructose 1,6-bisphosphate aldolase 

9q13-q32 

Fructosuria 

=1/100,000 

Fructokinase 

2p23 

Hypolactasia (adult) 

Common 

Lactase 

2q21 

Diabetes mellitus type 1 

1/400 (Europeans) 

Multiple 

Polygenic 

Diabetes mellitus type 2 

1/20 

Multiple 

Polygenic 

Maturity-onset diabetes of the 
young (MODY) 

=1/400 

Multiple 

Multiple loci 

Amino Acid Disorders 

Phenylketonuria 

1/10,000 

Phenylalanine hydroxylase 

12q24 

Tyrosinemia (type 1) 

1/100,000 

Fumarylacetoacetate hydrolase 

15q23-25 

Maple syrup urine disease 

1/180,000 

Branched-chain a-ketoacid 
dehydrogenase (multiple subunits) 

Multiple loci 

Alkaptonuria 

1/250,000 

Homogentisic acid oxidase 

3q2 

Homocystinuria 

1/340,000 

Cystathionine p-synthase 

21 q2 

Oculocutaneous albinism 

1/35,000 

Tyrosinase 

11q 

Cystinosis 

1/100,000 

CTNS 

17p13 

Cystinuria 

1/7000 

SLC3A1 (type 1) 

SLC7A9 (types II & III) 

2p 

19q13 

Lipid Disorders 

MCAD 

1/1,000 to 1/15,000 

Medium-chain acyl-CoA 
dehydrogenase 

1p31 

VLCAD 

1/30,000 

Very Long-chain acyl-CoA 
dehydrogenase 

17p13.1 

SLO 

1/10,000 

A7-sterol reductase 

11q12-q13 

Organic Acid Disorders 

Methylmalonic acidemia 

1/20,000 

Methylmalonyl-CoA mutase 

6p 

Propionic acidemia 

Rare 

Propionyl-CoA carboxylase 

13q32; 3q 

Urea Cycle Defects 

Ornithine transcarbamylase 
deficiency 

1/60,000 

Ornithine carbamyl transferase 

Xp21 

Carbamyl phosphate synthetase 
deficiency 

1/300,000 

Carbamyl phosphate synthetase 1 

2p 

Argininosuccinic acid synthetase 
deficiency 

1/250,000 

Argininosuccinic acid synthetase 

9q34 

Energy Production Defects 

Cytochrome c oxidase deficiency 

Rare 

Cytochrome oxidase peptides 

Multiple loci 

Pyruvate carboxylase deficiency 

Rare 

Pyruvate carboxylase 

11q 

Pyruvate dehydrogenase complex 
(El) deficiency 

Rare 

Pyruvate decarboxylase, Eia 

Xp22 

NADH-CoO reductase deficiency 

Rare 

Multiple nuclear genes 

Multiple loci 

Heavy Metal Transport Defects 

Wilson disease 

1/50,000 

ATP7B 

13q14 

Menkes disease 

1/250,000 

ATP7A 

Xq13 

Hemochromatosis 

1/200 to 1/400 (Europeans) 

HFE 

6p21 

Acrodermatitis enteropathica 

Rare 

SLC39A4 

8q24 


MCAD, Medium-chain acyl-CoA dehydrogenase: SLO, Smith-Lemli-Opitz syndrome; VLCAD, very long-chain acyl-CoA dehydrogenase. 
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CLINICAL COMMENTARY 7-1 
Diagnosis of a Metabolic Disorder 

The presentations of persons with inborn errors of metabolism 
are highly variable. During gestation, the maternal-placental 
unit usually provides essential nutrients and prevents the accu¬ 
mulation of toxic substrates. Thus, a fetus is infrequently symp¬ 
tomatic. However, after birth, persons with metabolic disorders 
can present at ages ranging from the first day of life to adult¬ 
hood. The presentation may be precipitous and characterized 
by dramatic alterations in homeostasis and even death. In 
contrast, the disorder may be insidious, with only subtle 
changes in function over long periods. For most metabolic 
disorders, the presymptomatic period and onset of symptoms 
lie somewhere between these two extremes. The following 
case illustrates this point. 

Anthony is a 9-month-old Latin American boy who comes to 
the emergency department accompanied by his parents. His 
parents complain that he has been irritable and vomiting for 
the last 36 hours, and over the past 12 hours he has become 
increasingly sleepy. They sought medical attention because it 
was difficult to awaken Anthony to breast-feed him. Anthony's 
medical history is unremarkable. He has a healthy 8-year-old 
sister and had a brother who died in his crib at 7 months of 
age. An investigation of the brother's death and an autopsy 
were performed. The findings were consistent with sudden 
infant death syndrome (SIDS). 

Anthony is hospitalized and is noted to be hypoglycemic (low 
serum glucose level), slightly acidemic (serum pH < 7.4), and 
hyperammonemic (elevated plasma ammonia). Intravenous 
infusion of glucose transiently improves his level of alertness, 
but he becomes comatose and dies 5 days later. An autopsy 
reveals marked cerebral edema (swelling of the brain) and fatty 
infiltration of the liver consistent with a diagnosis of Reye 
syndrome. Anthony's mother is concerned that the boys' 
deaths are related to each other, especially since she is preg¬ 
nant again. She is counseled that the causes of death are 
unrelated and neither disorder is likely to recur in her family. 


One year later her 6-month-old daughter, Maria, is hospital¬ 
ized for the third time because of lethargy and weakness. 
Laboratory studies reveal moderate hypoglycemia, hyperam¬ 
monemia, and ketonuria (ketones in the urine). Additional 
studies, including measurement of urine organic acids,* serum 
amino acids, and free and esterified plasma carnitines, suggest 
that Maria has a defect of fatty acid oxidation. Therapy is 
initiated with intravenous glucose, oral carnitine, and the 
restriction of fats to no more than 20% of her caloric 
requirements. More specific biochemical and molecular 
studies confirm that Maria has medium-chain acyl-CoA dehy¬ 
drogenase (MCAD) deficiency. Molecular studies from pre¬ 
served tissues that had been collected at autopsy from 
Maria's deceased brothers indicate that they also had MCAD 
deficiency. Maria's asymptomatic older sister is similarly 
affected. Both girls are healthy 2 years later, eating a low-fat 
diet and using a carnitine supplement. They have a new baby 
brother who underwent prenatal testing for MCAD and was 
found to be unaffected. 

The disparate presentations of MCAD deficiency in this 
family (sudden death, acute illness, chronic illness, and 
asymptomatic) illustrate the phenotypic variability often 
observed in persons with inborn errors of metabolism, even 
those sharing an identical mutation. Thus, there might not be 
a disease-specific pattern of symptoms and findings. Often 
it is the heightened index of suspicion of care providers that 
leads to the testing necessary to identify a metabolic disorder. 
Supportive therapy can be lifesaving and should be initiated 
before making a diagnosis. Nevertheless, it is imperative that 
prudent attempts be made to make a specific diagnosis, 
because it can have important implications for the family (e.g., 
prenatal testing, presymptomatic therapy). The treatment of 
MCAD deficiency is completely effective in preventing early 
death from the toxic effects of accumulated fatty acid 
intermediates. 


*Organic acids are carbon-based acids that are products of intermediate metabolism and normally do not accumulate in plasma or urine 
beyond the buffering capacities of these fluids. 


transferase) (Fig. 7-2). This gene is composed of 11 exons 
distributed across 4 kb of DNA, and approximately 70% of 
galactosemia-causing alleles in people of Western European 
origin have a single missense mutation in exon 6, p.Q188R. As 
a result of diminished GAL-l-P uridyl transferase activity, 
affected persons cannot effectively convert galactose to 
glucose; consequently, galactose is alternatively metabolized 
to galactitol and galactonate (see Fig. 7-2). Although galactose 
and its metabolites accumulate in many tissues, the patho¬ 
physiology of classic galactosemia is not weU understood. 

Classic galactosemia typically manifests in the newborn 
period with poor sucking, failure to thrive, and jaundice. If 
galactosemia is left untreated, sepsis, hyperammonemia, and 
shock leading to death usually follow. Cataracts (opacifica¬ 
tion of the lens of the eye) are found in about 10% of infants. 
Newborn screening for galactosemia is widespread, and 
most affected persons are identified as they begin to develop 
symptoms. Early identification affords prompt treatment, 
which consists largely of eliminating dietary galactose. This 


substantially reduces the morbidity associated with the acute 
effects of elevated levels of galactose metabolites. Long-term 
disabilities may include poor growth, developmental delay,’ 
speech problems, and ovarian failure in females. These 
sequelae are thought to be caused by endogenous production 
of galactose. The effects of prospective dietary therapy on the 
prevalence of these long-term sequelae are less clear. Early 
studies suggested that there was no effect, but as more longi¬ 
tudinal data become available, it appears that patients treated 
early in life with dietary therapy may have a normal cognitive 
outcome. 

Galactosemia can also be caused by mutations in the genes 
encoding galactokinase or uridine diphosphate galactose- 
4-epimerase (UDP-galactose-4-epimerase) (see Fig. 7-2). 


’^Developmental delay is the delayed attainment of age-appropriate 
motor, speech, and/or cognitive milestones; the outcomes of 
developmental delay range from normal to profound intellectual 
disability. 
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Deficiency of galactokinase is associated with the formation 
of cataracts but does not cause growth failure, intellectual 
disability, or hepatic disease. Dietary restriction of galactose 
is also the treatment for galactokinase deficiency. Deficiency 
of UDP-galactose-4-epimerase can be limited to red blood 
cells and leukocytes, causing no ill effects, or it can be sys¬ 
temic and produce symptoms similar to those of classic 
galactosemia. Treatment is aimed at reducing the dietary 


Aldolase 



intake of galactose, but not as severely as in patients with 
classic galactosemia, because some galactose must be pro¬ 
vided to produce UDP-galactose for the synthesis of some 
complex carbohydrates. 



Galactosemia is one of the most common 
inherited disorders of carbohydrate metabo¬ 
lism. Newborn screening for galactosemia is 
widespread. Early identification allows prompt 
treatment, which consists largely of eliminat¬ 
ing dietary galactose. Mutations in the gene 
that encodes GAL-1-P uridyl transferase are 
the most common cause of galactosemia. 


Galactokinase 
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Galactose 
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Glucose 1- 
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phosphate 
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FIG 7-2 Major pathways of galactose metabolism. The most 
common enzymatic abnormality producing galactosemia is 
a defect of GAL-1-P uridyl transferase. Defects of galacto¬ 
kinase or of UDP-galactose 4-epimerase are much less 
common causes of galactosemia. GAL, Galactose; UDP, 
uridine diphosphate. 


Fructose 

Three autosomal recessive defects of fructose metabolism 
have been described. The most common is caused by muta¬ 
tions in the gene encoding hepatic fructokinase. This enzyme 
catalyzes the first step in the metabolism of dietary fructose, 
the conversion to fructose-1-phosphate (Fig. 7-3). Inactiva¬ 
tion of hepatic fructokinase results in asymptomatic fructos- 
uria (presence of fructose in the urine). 

In contrast, hereditary fructose intolerance (HFI) results 
in poor feeding, failure to thrive, hepatic and renal insuffi¬ 
ciency, and death. HFI is caused by a deficiency of fructose 
1,6-bisphosphate aldolase in the liver, kidney cortex, and 
small intestine. Infants and adults with HFI are asymptomatic 
unless they ingest fructose or sucrose (a sugar composed of 



FIG 7-3 Summary of glucose, fructose, and glycogen metabolism. Enzymatic defects in 
this pathway cause hyperglycemia (V, Von Gierke disease (2), fructosuria (3), hereditary 
fructose intolerance (4), Cori disease (5), Anderson disease (6), Tarui disease (7), and fructose 
1,6-bisphosphatase (FBPase) deficiency (8). UDP, Uridine diphosphate. 

















































































CHAPTER 7 Biochemical Genetics: Disorders of Metabolism 


137 


fructose and glucose). Infants who are breast-fed become 
symptomatic after weaning, when fruits and vegetables are 
added to their diet. Affected infants may survive into child¬ 
hood because they avoid foods they consider noxious, thereby 
self-limiting their intake of fructose. The prevalence of HFI 
may be as high as 1 in 20,000 births, and, since the cloning 
of the gene encoding fructose-1-phosphate aldolase, differ¬ 
ences in the geographic distribution of mutant alleles have 
been found. 

Deficiency of hepatic fructose 1,6-bisphosphatase (FBPase) 
causes impaired gluconeogenesis, hypoglycemia (reduced 
level of circulating glucose), and severe metabolic acidemia 
(serum pH <7.4). Affected infants commonly present for 
treatment shortly after birth, although cases diagnosed later 
in childhood have been reported. If patients are adequately 
supported beyond childhood, growth and development 
appear to be normal. A handful of mutations have been found 
in the gene encoding FBPase, some of which encode mutant 
proteins that are inactive in vitro. 

Asymptomatic deficiency of fructokinase is the 
most common defect of fructose metabolism. 
Hereditary fructose intolerance is less preva¬ 
lent but is associated with much more severe 
problems. 

Glucose 

Abnormalities of glucose metabolism are the most common 
errors of carbohydrate metabolism. However, the causes of 
these disorders are heterogeneous and include both environ¬ 
mental and genetic factors. Historically, disorders associated 
with elevated levels of plasma glucose (hyperglycemia) have 
been classified into three categories: type 1 diabetes mellitus 
(TIDM), type 2 diabetes mellitus (T2DM), and maturity- 
onset diabetes of the young (MODY), of which there are 
many subtypes. TIDM is associated with reduced or absent 
levels of plasma insulin and usually manifests in childhood. 
DM type 2 is characterized by insulin resistance and, most 
commonly, adult onset. A more detailed discussion of TIDM 
and T2DM can be found in Chapter 12. 

Substantial advances in understanding of the pathophysi¬ 
ology of common diabetes have been achieved by identifying 
mutations that cause rare monogenic forms of hyperglyce¬ 
mia. Mutations in the insulin receptor gene have been associ¬ 
ated with a disorder characterized by insulin resistance and 
acanthosis nigricans (hypertrophic skin with a corrugated 
appearance). These mutations can decrease the number of 
insulin receptors on a cell’s surface, or they can decrease the 
insulin-binding activity level or the insulin-stimulated tyro¬ 
sine kinase activity level. Mutations in mitochondrial DNA 
and in the genes encoding insulin and glucokinase have also 
been associated with hyperglycemic disorders. 

Studies of rare, monogenic forms of diabetes 
define the pathways that may be disturbed 
in the more common forms of diabetes 
mellitus. 


Lactose 

The ability to metabolize lactose (a sugar composed of glucose 
and galactose) depends, in part, on the activity of an intesti¬ 
nal brush-border enzyme called lactase-phlorizin hydrolase 
(LPH). In most mammals, LPH activity diminishes after 
infants are weaned from maternal milk. However, the persis¬ 
tence of intestinal LPH activity is a common autosomal reces¬ 
sive trait in human populations, with an incidence ranging 
from 5% to 90%. The geographic distribution of lactase per¬ 
sistence is concordant with areas of high milk intake, such as 
northwestern Europe and certain parts of Africa. The persis¬ 
tent ability for adults to use dairy products as a source of 
vitamin D had a selective advantage in these populations. 

Lactase nonpersistence (i.e., adult-type hypolactasia or 
lactose intolerance) is common in most tropical and sub¬ 
tropical countries. Persons with lactase nonpersistence can 
experience nausea, bloating, and diarrhea after ingesting 
lactose. Thus, in many regions where reduced lactase activity 
is prevalent, the lactose in dairy products is often partially 
metabolized (e.g., by lactobacilli in the preparation of yogurt) 
before consumption. The role of lactase nonpersistence as a 
cause of abdominal pain and symptoms of irritable bowel 
syndrome is controversial. 

LPH is encoded by the lactase gene (LCT) on chromosome 
2. In European populations, adult LPH expression is regu¬ 
lated by a polymorphism located in an upstream gene named 
minichromosome maintenance 6 (MCM6). However, in sub- 
Saharan African, Central Asian, and Arabian peninsular 
populations in which lactase persistence is common, this 
polymorphism is found at low frequency or is absent. Lactase 
persistence in these populations is caused by different poly¬ 
morphisms that appear to increase transcription of LCT. 
These polymorphisms appear to have arisen relatively recently 
in human evolution and have increased in incidence as a 
result of natural selection independently in populations in 
Europe and Africa. In each case, the selective force appears to 
have been a local adaptive response to the higher fitness 
afforded by the ability to consume dairy products. 

Mutations that abolish lactase activity altogether cause 
congenital lactase deficiency and produce severe diarrhea and 
malnutrition in infancy. Such mutations are very rare. 

Glycogen 

Carbohydrates are most commonly stored as glycogen in 
humans. Consequently, enzyme deficiencies that lead to 
impaired synthesis or degradation of glycogen are also con¬ 
sidered disorders of carbohydrate metabolism. Defects of 
each of the proteins involved in glycogen metabolism have 
been identified (Table 7-2). These cause different forms of 
glycogen storage disorders and are classified numerically 
according to the chronological order in which their enzy¬ 
matic basis was described. The two organs most severely 
affected by glycogen storage disorders are the liver and 
skeletal muscle. Glycogen storage disorders that affect the 
liver typically cause hepatomegaly (enlarged liver) and hypo¬ 
glycemia (low plasma glucose level). Glycogen storage disor¬ 
ders that affect skeletal muscle cause exercise intolerance. 
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TABLE 7-2 

Glycogen Storage Disorders 


TYPE 

DEFECT 

MAJOR AFFECTED TISSUES 

la (Von Gierke) 

Glucose-6-phosphatase 

Liver, kidney, intestine 

lb 

Microsomal glucose-6-phosphate transport 

Liver, kidney, intestine, neutrophils 

II (Pompe) 

Lysosomal acid fi-glucosidase 

Muscle, heart 

Ilia (Corl) 

Glycogen debranching enzyme 

Liver, muscle 

lllb 

Glycogen debranching enzyme 

Liver 

IV (Anderson) 

Branching enzyme 

Liver, muscle 

V (McArdle) 

Muscle phosphorylase 

Muscle 

VI (Hers) 

Liver phosphorylase 

Liver 

VII (Tarul) 

Muscle phosphofructokinase 

Muscle 


progressive weakness, and cramping. Some glycogen storage 
disorders, such as Pompe disease, can also affect cardiac 
muscle, causing cardiomyopathy and early death. Treatment 
of some glycogen storage disorders by enzyme replacement 
(e.g., giving active forms of the enzyme intravenously) can 
improve, and in some cases prevent, symptoms and therefore 
preserve function and prevent early death. 

Each disorder of glycogen metabolism is 
uncommon, yet taken together these defects 
account for substantial morbidity. Early inter¬ 
vention can prevent severe disabilities and 
early death. 

Amino Acid Metabolism 

Proteins play the most diverse roles of the major biomole¬ 
cules (e.g., providing mechanical support, coordinating 
immune responses, generating motion). Indeed, nearly all 
known enzymes are proteins. The fundamental structural 
units of proteins are amino acids. Some amino acids can be 
synthesized endogenously (nonessential), and others must be 
obtained from the environment (essential). Many defects of 
the metabolism of amino acids have been described. 

Phenylalanine 

Defects in the metabolism of phenylalanine (an essential 
amino acid) cause the hyperphenylalaninemias, some of the 
most widely studied of all inborn errors of metabolism. These 
disorders are caused by mutations in genes that encode com¬ 
ponents of the phenylalanine hydroxylation pathway (see Fig. 
7-1). Elevated levels of plasma phenylalanine disrupt essential 
cellular processes in the brain such as myelination and protein 
synthesis, eventually producing severe intellectual disability. 
Most cases of hyperphenylalaninemia are caused by muta¬ 
tions of the gene that encodes phenylalanine hydroxylase 
(PAH), resulting in classic phenylketonuria (PKU). Hundreds 
of different mutations have been identified in PAH, including 
substitutions, insertions, and deletions. The prevalence of 
hyperphenylalaninemia varies widely among groups from 
different geographic regions; PKU ranges from 1 in every 
10,000 people of European origin to 1 in every 90,000 in those 
of African ancestry. Less commonly, hyperphenylalaninemia 
is caused by defects in the synthesis of tetrahydrobiopterin, a 


cofactor necessary for the hydroxylation of phenylalanine, or 
by a deficiency of dihydropteridine reductase. 

Treatment of most hyperphenylalaninemias is aimed at 
restoring normal blood phenylalanine levels by restricting 
dietary intake of phenylalanine-containing foods (Box 7-1). 
However, phenylalanine is an essential amino acid, and ade¬ 
quate supplies are necessary for normal growth and develop¬ 
ment. A complete lack of phenylalanine is fatal. Thus, a fine 
balance must be maintained between providing enough 
protein and phenylalanine for normal growth and preventing 
the serum phenylalanine level from rising too high. Persons 
with PKU clearly benefit from lifelong treatment. Thus, once 
PKU is diagnosed, the person must follow a phenylalanine- 
restricted diet for life. Sapropterin dihydrochloride, a form of 
tetrahydrobiopterin (BH4) can help BH-4 responsive PKU 
patients. 

Hyperphenylalaninemia in a pregnant woman results in 
elevated phenylalanine levels in the fetus. This can cause poor 
growth, birth defects, microcephaly, and intellectual disability 
in the fetus (regardless of the fetus’s genotype). Thus, it is 
important that women with PKU receive appropriate preg¬ 
nancy counseling. Optimally, they should maintain a low- 
phenylalanine diet at conception and throughout pregnancy. 

Hyperphenylalaninemias are the most common 
defects of amino acid metabolism. Classic 
PKU is caused by mutations in the gene that 
encodes phenylalanine hydroxylase. Hyper¬ 
phenylalaninemias are treated by restricting 
dietary intake of phenylalanine-containing 
foods. 

Tyrosine 

The amino acid tyrosine is the starting point of the synthetic 
pathways leading to catecholamines, thyroid hormones, and 
melanin pigments, and it is incorporated into many proteins. 
Elevated levels of serum tyrosine can be acquired (e.g., severe 
hepatocellular dysfunction), or they can result from an inborn 
error of catabolism, of which there are several. Hereditary 
tyrosinemia type 1 (HTl) is the most common metabolic 
defect of tyrosine and is caused by a deficiency of fumaryl- 
acetoacetate hydrolase (FAH), which catalyzes the last step in 
tyrosine catabolism (see Fig. 7-1). Accumulation of FAH’s 













CHAPTER 7 Biochemical Genetics: Disorders of Metabolism 


139 


BOX 7-1 Dietary Management of Inborn Errors of Metabolism 


The most important component of therapy for many inborn 
errors of metabolism is manipulation of the diet. This com¬ 
monly Includes restriction of substrates that are toxic, such as 
carbohydrates (e.g., In galactosemia, diabetes mellitus), fats 
(e.g., in MCAD deficiency), and amino acids (e.g., in PKU, 
MSUD, urea cycle defects); avoidance of fasting: replacement 
of deficient cofactors (e.g., B vitamins, carnitine); or using 
alternative pathways of catabolism to eliminate toxic sub¬ 
stances. Because many metabolic disorders are diagnosed in 
infants whose nutritional requirements change quickly (some¬ 
times weekly), it is imperative to provide infants with diets that 
provide adequate calories and nutrients for normal growth 
and development. Thus, the responsibility for maintaining a 
special diet begins with the parents of an affected child and 
shifts to the child when he or she is capable of managing 
independently. For most persons with metabolic diseases, 
a special diet must be maintained for life. This results In 
many unique challenges that are often unforeseen by families 
and care providers alike. Consequently, It Is Important that 
support and guidance be provided by clinical dietitians, gastro¬ 
enterologists, psychologists, genetic counselors, and biochem¬ 
ical geneticists. 

For example, newborns with PKU are fed a low-phenylalanine 
diet to prevent the effects of hyperphenylalaninemia on the 
brain. Breast milk contains too much phenylalanine to be used 
as the only source of nutrients. Therefore, many infants are 
fed an expensive low-phenylalanine formula that is available 
only by prescription, a medical food.* Small quantities of breast 
milk can be mixed with the formula, although the breast 
milk must be pumped and carefully titrated to avoid giving 
the infant too much phenylalanine. Serum phenylalanine levels 
are measured frequently, and adjustments must be made to 
the diet to compensate for an infant's growth and varying 
individual tolerances for phenylalanine. These interventions can 
disrupt mother-infant bonding and distort a family's social 
dynamics. 

As a child with PKU grows older, low-protein food substitutes 
are introduced to supplement the formula (e.g., low-protein 
breads and pasta). To put this in perspective, consider that a 
10-year-old child with PKU might Ingest 300 to 500 mg of 
phenylalanine per day. Thus, three or four slices of regular 
bread would meet a child's nutritional needs and dietary phe¬ 
nylalanine limit because of the relatively high protein content 
of grains. Low-protein foods make the diet more substantive 
and varied. Indeed, seven slices of low-protein bread contain 
the phenylalanine equivalent of one piece of regular bread. Yet, 
many of these foods have an odor, taste, texture, or appear¬ 
ance that differs distinctively from foods containing normal 
amounts of protein. 

The intake of many fruits, fats, and carbohydrates is less 
restricted (see table at the right). However, phenylalanine is 
found unexpectedly in many food items (e.g., gelatin, beer). In 
fact, the FDA requires manufacturers to label products contain¬ 
ing aspartame (a common artificial sweetener that contains 
phenylalanine) with a warning for persons with PKU. 

Teenagers with PKU sometimes have difficulty socializing 
with their peers because their dietary restrictions limit food 
choices at restaurants, sporting events, and parties. Adults 


with PKU must consume more medical food than they did 
during childhood due to their size and protein requirements. 
Women with PKU must be on a severely restricted low- 
phenylalanine diet during pregnancy because hyperphenylal¬ 
aninemia is a known teratogen (see text). 

Inborn errors of metabolism are chronic diseases that are 
often treated by substantially modifying the diet. This can 
require considerable changes in lifestyle, precipitating financial 
and emotional hardships of which health-care providers need 
to be aware. 


PKU CALORIE SOURCES 



□ Low-phenylalanine medical foods 

□ No-phenylalanine medical foods 

□ Natural foods 

Sources of calories of persons with phenylketonuria (PKU) 
at different ages. The amount of no-protein medical foods 
and low-protein medical foods eaten increases with age as 
the need for energy and protein increases. (Courtesy Kath¬ 
leen Huntington and Diane Waggoner, Oregon Health and 
Science University.) 

Phenylalanine Content of Some 
Common Foods 

PHENYLALANINE 


FOOD 

MEASURE 

(mg) 

Turkey, light meat 

1 cup 

1662 

Tuna, water-packed 

1 cup 

1534 

Baked beans 

1 cup 

726 

Lowfat milk (2% fat) 

1 cup 

393 

Soy milk 

1 ounce 

46 

Breast milk 

1 ounce 

14 

Broccoli (raw) 

3 tablespoons 

28 

Potato (baked) 

2 tablespoons 

14 

Watermelon 

1/2 cup 

12 

Grapefruit (fresh) 

1/4 fruit 

13 

Beer 

6 ounces 

11 

Gelatin dessert 

1/2 cup 

36 


*U.S. Public Law 100-290 defines a medical food as a food that is formulated to be consumed or administered enterally under the 
supervision of a physician and that is intended for the specific dietary management of a disease or condition for which distinctive nutritional 
requirements, based on recognized scientific principles, are established by medical evaluation. 
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substrate, fumarylacetoacetate, and its precursor, maleylace- 
toacetate, is thought to be mutagenic and toxic to the liver. 
Consequently, HTl is characterized by dysfunction of the 
renal tubules, acute episodes of peripheral neuropathy, pro¬ 
gressive liver disease leading to cirrhosis, and a high risk for 
developing liver cancer (hepatocellular carcinoma). 

Management of HTl includes supportive care, dietary 
restriction of phenylalanine and tyrosine, and admin¬ 
istration of 2-(nitro-4-trifluoro-methylbenzoyl)-l,3-cyclo- 
hexanedione (NTBC) or nitisinone, an inhibitor of an enzyme 
upstream of FAH (4-hydroxyphenylpyruvate dioxygenase). 
Use of NTBC, combined with a low-tyrosine diet, has pro¬ 
duced marked improvement in children with HTl. The long¬ 
term effects of NTBC are still unclear, but children who have 
been treated for more than 20 years appear to be doing well. 
Liver transplantation can be curative but is typically reserved 
for persons who fail to respond to NTBC or who develop a 
malignancy. In a mouse model of HTl, gene therapy (see 
Chapter 13) has been used to repopulate the liver with cells 
that exhibit stable long-term expression of FAH (FAH^ 
hepatocytes). Because some FAH-hepatocytes remain in 
these livers, it is unclear whether the risk of hepatocellular 
carcinoma is reduced. Gene therapy for HTl in humans 
might someday replace life-long dietary and pharmacologic 
treatment. 

Mutations in FAH have been identified in many families. 
A splice-site mutation is quite common in French-Canadians, 
and its high frequency is probably the result of founder effect 
(see Chapter 3). This mutation results in an in-frame deletion 
of an exon, which removes a series of critically important 
amino acids from FAH. Missense and nonsense mutations of 
PAH have also been found in persons with HTl. 

Tyrosinemia type 2 (oculocutaneous tyrosinemia) is 
caused by a deficiency of tyrosine aminotransferase. It is 
mainly characterized by corneal erosions and thickening of 
the skin on the palms and the soles. Tyrosinemia type 3 is 
associated with reduced activity of 4-hydroxyphenylpyruvate 
dioxygenase. Only a few affected persons have been reported. 

Deficiency of FAH causes HTl. Accumulation 
of the substrates of FAH leads to neurological, 
kidney, and liver dysfunction. Although liver 
transplantation has been the cornerstone of 
treatment for HTl, the use of drugs that block 
the production of FAH has proved effective. 

Branched-Chain Amino Acids 

Approximately 40% of the preformed amino acids required 
by mammals are branched-chain amino acids (BCAAs) such 
as valine, leucine, and isoleucine. BCAAs can be used as a 
source of energy through an oxidative pathway that uses an 
a-ketoacid as an intermediate. Decarboxylation of a-keto- 
acids is mediated by a multimeric enzyme complex called 
branched-chain a-ketoacid dehydrogenase (BCKAD). The 
BCKAD complex is composed of at least four catalytic com¬ 
ponents and two regulatory enzymes, which are encoded by 
six genes. A deficiency of any one of these six components 


produces a disorder known as maple syrup urine disease 
(MSUD), so named because the urine of affected persons has 
an odor reminiscent of maple syrup. 

The prevalence of MSUD in the general population is low, 
but MSUD is relatively common in the Mennonite commu¬ 
nity of Lancaster County, Pennsylvania, where approximately 
1 in every 7 persons is a heterozygous carrier. All of these 
carriers have the same disease-causing mutation of Eja, one 
of the loci encoding a catalytic component of BCKAD, and 
all are descendants of a couple who emigrated from Europe 
in the 18th century. This is another example of founder effect 
in a small, relatively isolated population (see Chapter 3). 

Untreated patients with MSUD accumulate BCAAs and 
their associated ketoacids, leading to progressive neurodegen¬ 
eration and death in the first few months of life. Treatment 
of MSUD consists of dietary restriction of BCAAs to the 
minimum required for normal growth. Despite treatment, 
episodic deterioration is common, and supportive care is 
required during these crises. Because increasing BCKAD 
activity by only a few percentage points can alter the course 
of disease substantially, therapy with thiamine, a cofactor of 
BCKAD, is used to treat these patients. Gene therapy for 
MSUD is also being investigated. 

Maple syrup urine disease is caused by defects 
in branched-chain a-ketoacid dehydrogenase. 
Accumulation of BCAAs causes progressive 
neurodegeneration and death. Treatment con¬ 
sists of restricting dietary intake of BCAAs to a 
minimal level. 

Early detection of amino acid defects, coupled with 
prompt intervention, can prevent severe physical impairment 
and death. Moderate increases of enzyme activity can dra¬ 
matically alter the course of some aminoacidopathies, making 
them good candidates for somatic cell gene therapy. 

Lipid Metabolism 

Lipids (Greek, lipos, “fat”) are a heterogeneous group of bio¬ 
molecules that are insoluble in water and highly soluble in 
organic solvents (e.g., chloroform). They provide the back¬ 
bone for phospholipids and sphingolipids, which are compo¬ 
nents of all biological membranes. Lipids such as cholesterol 
are also constituents of steroid hormones; they act as intracel¬ 
lular messengers and serve as an energy substrate. Elevated 
serum lipid levels (hyperlipidemia) are common and result 
from defective lipid transport mechanisms. Errors in the 
metabolism of fatty acids (hydrocarbon chains with a termi¬ 
nal carboxylate group) are much less common. However, 
characterizing hyperlipidemias (see Chapter 12), errors of 
fatty acid metabolism, and defects of cholesterol production 
and use has been a powerful approach for understanding the 
biochemical basis of lipid catabolism. 

During fasting and prolonged aerobic exercise, fatty acids 
are mobilized from adipose tissue and become a major sub¬ 
strate for energy production in the liver, skeletal muscle, and 
cardiac muscle. Major steps in this pathway include the 
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FIG 7-4 Summary of fatty acid metabolism: (1) fatty acid 
entry into a cell, (2) activation and transesterification, (3) mito¬ 
chondrial uptake, (4) oxidation via |3-oxidation, (5) formation 
of ketone bodies. Note that medium-chain fatty acids can 
traverse the mitochondrial membrane without carnitine- 
mediated transport. CoA, Coenzyme A. 

uptake and activation of fatty acids by cells, transport across 
the outer and inner mitochondrial membranes, and entry 
into the (J-oxidation spiral in the mitochondrial matrix (Fig. 
7-4). Defects in each of these steps have been described in 
humans, although defects of fatty acid oxidation (FAO) are 
the most common. 

Fatty Acids 

The most common inborn error of fatty acid metabolism 
results from a deficiency of medium-chain acyl-coenzyme A 
dehydrogenase (MCAD). MCAD deficiency is characterized 
by episodic hypoglycemia, which is often provoked by fasting 
(see Clinical Commentary 7-1). Commonly, a child with 
MCAD deficiency presents with vomiting and lethargy after 
a period of diminished oral intake due to a minor illness (e.g., 
upper respiratory illness, gastroenteritis). Fasting results in 
the accumulation of fatty acid intermediates, a failure to 
produce ketones in sufficient quantities to meet tissue 
demands, and the exhaustion of glucose supplies. Cerebral 
edema and encephalopathy result from the indirect and direct 
effects of fatty acid intermediates in the central nervous 
system. Death often follows unless a usable energy source 
such as glucose is provided promptly. Between these episodes, 
children with MCAD deficiency often have normal examina¬ 
tions. Treatment consists of the avoidance of fasting, ensuring 
an adequate source of calories, and providing supportive care 
during periods of nutritional stress. 


To date, most of the reported MCAD patients are of north¬ 
western European origin, and the majority have an A-to-G 
missense mutation, c.985A>G, that results in the substitution 
of glutamate for lysine. Additional substitution, insertion, 
and deletion mutations have been identified but are much 
less common. The molecular characterization of MCAD has 
made it possible to offer direct DNA testing as a reliable and 
inexpensive diagnostic tool. Furthermore, because MCAD 
deficiency meets the criteria established for newborn screen¬ 
ing (see Chapter 13), testing for it has been added to newborn 
screening programs in all fifty states of the United States and 
elsewhere. 

Long-chain acyl-CoA fatty acids are metabolized in a 
sequence of steps catalyzed by a handful of different enzymes. 
The first step is controlled by long-chain acyl-CoA dehydro¬ 
genase (LCAD). The second step is catalyzed by enzymes that 
are part of an enzyme complex called the mitochondrial tri¬ 
functional protein (TFP). One of the enzymes of the TFP is 
long-chain L-3-hydroxyacyl-CoA dehydrogenase (LCFiAD). 

TCHAD deficiency is one of the most severe of the FAO 
disorders. The first cases reported manifested with severe liver 
disease ranging from fulminant neonatal liver failure to a 
chronic, progressive destruction of the liver. Over the last 10 
years, the phenotype has expanded to include cardiomyopa¬ 
thy, skeletal myopathy, retinal disease, peripheral neuropathy, 
and sudden death. Its clinical and biochemical characteristics 
clearly differentiate it from other FAO disorders. Treatment 
consists of avoidance of fasting, a low-fat diet, supplementa¬ 
tion with medium-chain triglycerides, and carnitine supple¬ 
mentation in some cases. 

In the last decade, a number of women pregnant with a 
fetus affected with TCHAD deficiency have developed a severe 
liver disease called acute fatty liver of pregnancy (AFLP) and 
HELLP syndrome (hemolysis, elevated liver function tests, 
tow platelets). It is hypothesized that failure of the fetus to 
metabolize free fatty acids results in the accumulation of 
abnormal fatty acid metabolites in the maternal liver and 
placenta. Accumulation in the liver might cause the abnor¬ 
malities observed in women with AFLP and HELLP. Accumu¬ 
lation in the placenta might cause intrauterine growth 
retardation and increase the probability of preterm delivery, 
both of which are common in children with TCHAD 
deficiency. 

Cholesterol 

Elevated levels of plasma cholesterol have been associated 
with various conditions, most notably atherosclerotic heart 
disease. It has been demonstrated that substantially reduced 
levels of cholesterol can adversely affect growth and develop¬ 
ment. The final step of cholesterol biosynthesis is catalyzed 
by the enzyme A7-sterol reductase (DHCR7). For years it 
has been noted that persons with an autosomal recessive 
disorder called Smith-Lemli-Opitz (SLO) syndrome have 
reduced levels of cholesterol and increased levels of 
7-dehydrocholesterol (a precursor of DHCR7). SLO is char¬ 
acterized by various congenital anomalies of the brain, heart, 
genitalia, and hands (Fig. 7-5). It is unusual in this respect 
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FIG 7-5 A child with Smith-Lemli-Opitz syndrome. Note the 
broad nasal root, upturned nasal tip, and inner epicanthal 
folds that are characteristic of this disorder. (From Jones K. 
Smith's Recognizable Patterns of Human Maiformation. 6th 
ed. Philadelphia: Saunders; 2006:116.) 


because most inborn errors of metabolism do not cause con¬ 
genital malformations. 

In 1998, SLO was discovered to be caused by mutations in 
the DHCR7 gene, and to date more than 100 different muta¬ 
tions in DHCR7 have been found. Most of these are missense 
mutations that result in substitutions of a highly conserved 
residue of the protein. Population screens for mutant DHCR7 
alleles suggest that the carrier frequency in populations of 
European ancestry is 3% to 4%. This high frequency suggests 
the incidence of SLO should be much higher than is com¬ 
monly observed. One explanation is that some pregnancies 
with affected fetuses result in miscarriages or that SLO is 
undetected in some mildly affected patients. Supplementing 
the diet of SLO children with cholesterol can ameliorate their 
growth and feeding problems, although its effect on cognitive 
development is less clear. 

The clinical presentation of children with 
defects of lipid metabolism varies from slow 
deterioration to sudden death. MCAD defi¬ 
ciency is the most common of these disorders. 
Most affected persons can be diagnosed by 
biochemical analysis of a drop of dried blood 
at birth. 

Steroid Hormones 

Cholesterol is the precursor for several major classes of 
steroid hormones including glucocorticoids (e.g., cortisol), 
mineralocorticoids (e.g., aldosterone), androgens (e.g., tes¬ 
tosterone), and estrogens (Fig. 7-6). The actions of these 
steroid hormones are typically mediated by binding to an 


intracellular receptor, and these receptor-ligand complexes 
have myriad effects on a wide range of physiological pro¬ 
cesses. Defects in the synthesis of steroid hormones or their 
receptors can cause a wide variety of clinical abnormalities. 

Congenital adrenal hyperplasia (CAH) consists of a group 
of genetically heterogeneous autosomal recessive disorders of 
cortisol biosynthesis. Approximately 95% of cases of CAH are 
caused by mutations in CYP21A2, the gene that encodes 
21-hydroxylase, and are characterized by cortisol deficiency, 
variable deficiency of aldosterone, and an excess of andro¬ 
gens. The overall incidence of 21-hydroxylase deficiency is 
about 1 in 15,000; therefore, the carrier frequency is about 1 
in every 60 persons. However, the incidence of CAH varies 
widely among different ethnic groups. For example, among 
the Yupik of Alaska the incidence is 1 in 280. 

The clinical severity of CAH varies widely and depends on 
the extent of residual 21-hydroxylase activity. The severe or 
classic form is typified by overproduction of cortisol precur¬ 
sors and thus excess adrenal androgens. In addition, aldoste¬ 
rone deficiency leads to loss of salt. In the milder forms, 
sufficient cortisol is produced, but there is still an overpro¬ 
duction of androgens. Female infants with CAH typically 
have ambiguous genitalia (Fig. 7-7) at birth due to in utero 
exposure to high concentrations of androgens, and CAH is 
the most common cause of ambiguous genitalia in 46,XX 
infants. Boys with CAH have normal genitalia at birth, so the 
age of diagnosis depends on the severity of aldosterone defi¬ 
ciency. Most boys have the salt-losing form of CAH and typi¬ 
cally present between 7 and 14 days of age in adrenal crisis 
manifested as weight loss, lethargy, dehydration, hyponatre¬ 
mia (decreased plasma Na^ concentration), and hyperkalemia 
(increased plasma concentration). If CAH is left untreated, 
death soon follows. Adrenal crisis is less common in girls 
because their ambiguous genitalia typically leads to early 
diagnosis and intervention. 

Boys who do not have a salt-losing form of CAH present 
at 2 to 4 years of age with premature virilization. Persons with 
nonclassic or mild CAH do not have cortisol deficiency but 
manifest symptoms in late childhood or adulthood caused by 
increased androgen levels including premature pubertal 
development, hirsutism (increased hair growth in women in 
areas where hair is usually thin), amenorrhea or oligomenor¬ 
rhea, polycystic ovaries, or acne. 

Treatment of CAH consists of replacing cortisol, suppress¬ 
ing adrenal androgen secretion, and providing mineralocor¬ 
ticoids to return electrolyte concentrations to normal. The 
surgical management of children born with ambiguous geni¬ 
talia is complex and somewhat controversial, but most girls 
with CAH identify as female, and feminizing surgery during 
the first year of life is standard. In pregnancies in which the 
fetus is at risk for classic CAH, steroids are administered to 
the mother to suppress the fetal overproduction of androgens 
and reduce the incidence of ambiguous genitalia in affected 
female infants. 

CYP21A2 is located on chromosome 6p21 within the 
major histocompatibility complex (MHC; see Chapter 9). 
More than 75% of mutant CYP21A2 alleles are caused by 
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FIG 7-6 Summary of steroid synthesis. Enzymes involved in the production of cortisol, aldoste¬ 
rone, and testosterone are indicated. (Modified from Turnpenny P. Emery's Elements of Medical 
Genetics. 13th ed. Philadelphia: Churchill Livingstone: 2007.) 


gene conversion* in which deleterious mutations are trans¬ 
ferred to CYP1A2. These mutations result in a protein product 
that lacks normal 21-hydroxylase activity. 

Congenital adrenal hyperplasia is a relatively 
common defect of cortisol synthesis that 
causes masculinization of the genitalia in 
females and premature virilization in males. It 
is commonly caused by diminished activity of 
21-hydroxylase. 

Steroid Hormone Receptors 

Defects of most steroid hormone receptors are rare. For 
example, defects of the estrogen receptor have been found in 
a small number of persons in whom failed epiphyseal closure 


*Gene conversion is a process in which two different DNA segments 
recombine in such a way that one segment is altered to become 
identical to the other. 


results in tall stature. Mutations in the gene that encodes the 
glucocorticoid receptor can produce hereditary resistance to 
the actions of cortisol. 

In contrast, mutations in the X-linked gene that encodes 
the androgen receptor (AR) are relatively common. These 
mutations commonly result in complete or partial androgen 
insensitivity syndromes (CAIS or PAIS) in 46,XY persons. 
CAIS was formerly called “testicular feminization syndrome” 
and is characterized by typical female external genitalia at 
birth, absent or rudimentary miillerian structures (i.e., fal¬ 
lopian tubes, uterus, and cervix), a short vaginal vault, ingui¬ 
nal or labial testis, and reduced or absent secondary sexual 
characteristics. Persons with PAIS can have ambiguous exter¬ 
nal genitalia, varied positioning of the testes, and absent or 
present secondary sexual characteristics. Nearly all affected 
persons are infertile. 

CAIS and PAIS are transmitted in an X-linked recessive 
pattern. More than 95% of persons with CAIS have muta¬ 
tions in the AR gene, and to date, hundreds of different muta¬ 
tions have been found. Most of these mutations are predicted 
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FIG 7-7 A 46,XX infant with congenital adrenal hyperplasia. 
The external genitalia are virilized, with rogation and partial 
fusion of the labial folds. No gonads were palpable, and a 
uterus and ovaries were visualized by ultrasonography. (Cour¬ 
tesy Melissa Parisi, University of Washington.) 

to impair androgen binding or binding of the androgen 
receptor to DNA. Expansion of a polyglutamine tract in the 
androgen receptor causes a completely different genetic dis¬ 
order called spinal bulbar muscular atrophy (see Chapter 5). 

Peroxisomal Enzymes 

Peroxisomes are organelles that contain more than 50 
enzymes that catalyze synthetic and degradative metabolic 
functions related mainly to lipid metabolism. Disorders of 
peroxisomes are typically divided into two groups: peroxi¬ 
some biogenesis disorders (PBDs) and the single peroxisomal 
enzyme deficiencies (PEDs). 

The PBD group comprises four disorders: Zellweger 
syndrome, neonatal adrenoleukodystrophy, infantile Refsum 
disease, and rhizomelic chondrodysplasia punctata type 1. 
Zellweger syndrome is the most severe of these disorders and 
manifests in newborns as severe hypotonia, progressive disease 
of the white matter of the brain, a distinctive facial appearance, 
and typically death in infancy. Children with neonatal adreno¬ 
leukodystrophy have similar but less-severe symptoms along 
with seizures. Infantile Refsum disease is less severe than Zell¬ 
weger syndrome and neonatal adrenoleukodystrophy, yet 
affected children still have developmental delay, learning dis¬ 
abilities, hearing loss, and visual impairment. 

PBDs are caused by mutations in PEX genes that encode 
peroxins. These proteins are necessary both for peroxisome 
biogenesis and for importing proteins of the peroxisomal 
matrix and membrane. To date, mutations in more than a 
dozen different peroxin-encoding genes have been discov¬ 
ered, and for many of these genes, mutations can cause Zell¬ 
weger syndrome, neonatal adrenoleukodystrophy, or infantile 
Refsum disease. 


Nearly a dozen different PEDs have been described to date, 
and persons with these enzyme defects have widely disparate 
clinical characteristics depending on which peroxisomal 
function is primarily impaired. One of the most common is 
X-linked adrenoleukodystrophy (ALD), which involves faulty 
(J-oxidation of very long chain fatty acids (VLCFA). ALD is 
subdivided into several disorders, depending in part on age 
of onset. The most common of these are childhood cerebral 
ALD or CCALD and adrenomyeloneuropathy (AMN). 
CCALD typically manifests between the ages of 3 and 10 years 
with progressive cognitive and behavioral deterioration that 
leads to profound disability. AMN causes similar but less 
severe neurological symptoms but has a much later age of 
onset and slower rate of progression. Unlike many X-linked 
recessive disorders, 40% to 50% of women who are hetero¬ 
zygous for ALD develop AMN-like symptoms. 

Peroxisomal defects cause a wide variety of 
neurological problems in children and adults. 
These range from relatively mild and slowly 
progressive to severe and life threatening. 

Degradative Pathways 

Most biomolecules are dynamic, perpetually being recycled 
as part of the normal metabolic state of a cell. Existing mol¬ 
ecules are degraded into their constituents to produce sub¬ 
strates for building new molecules. The byproducts of energy 
production, substrate conversions, and anabolism also need 
to be processed and eliminated. Errors in these degradative 
pathways result in the accumulation of metabolites that 
would otherwise have been recycled or eliminated. 

Lysosomal Storage Disorders 

The lysosomal storage disorders are the prototypical inborn 
errors of metabolism: Disease results from the accumulation 
of substrate. Enzymes within lysosomes catalyze the stepwise 
degradation of sphingolipids, glycosaminoglycans (muco¬ 
polysaccharides), glycoproteins, and glycolipids. Accumula¬ 
tion (storage) of undegraded molecules results in cell, tissue, 
and organ dysfunction. Most of the lysosomal disorders are 
caused by enzyme deficiencies, although some are caused by 
the inability to activate an enzyme or to transport an enzyme 
to a subcellular compartment where it can function properly. 
Many of the lysosomal storage disorders are found with 
uncommonly high prevalence in various ethnic populations 
as a result of founder effect, genetic drift, and possibly natural 
selection (see Chapter 3). 

Mucopolysaccharidoses 

The mucopolysaccharidoses (MPS disorders) are a heteroge¬ 
neous group of conditions caused by a reduced ability to 
degrade one or more glycosaminoglycans (e.g., dermatan 
sulfate, heparan sulfate, keratan sulfate, chondroitin sulfate). 
These glycosaminoglycans are degradation products of pro¬ 
teoglycans found in the extracellular matrix. Ten different 
enzyme deficiencies cause six different MPS disorders, which 
share many clinical features (Table 7-3), but these disorders 







CHAPTER 7 Biochemical Genetics: Disorders of Metabolism 


145 


TABLE 7-3 Mucopolysaccharidoses 


NAME 

MUTANT ENZYME 

CLINICAL FEATURES 

Hurler-Scheie (MPS-I) 

a-L-lduronidase 

Coarse face, hepatosplenomegaly, corneal clouding, dysostosis 
multiplex, intellectual disability 

Hunter (MPS-II) 

Iduronate sulfatase 

Coarse face, hepatosplenomegaly, dysostosis multiplex, 
intellectual disability, behavioral problems 

Sanfilippo A MPS-IIIA 

Heparan-AZ-sulfamidase 

Behavioral problems, dysostosis multiplex, intellectual disability 

Sanfilippo B MPS-IIIB 

a-A/-Acetylglucosaminidase 

Behavioral problems, dysostosis multiplex, intellectual disability 

Sanfilippo C MPS-IIIC 

Acetyl-CoA:a-glucosaminide 

A/-acetyltransferase 

Behavioral problems, dysostosis multiplex, intellectual disability 

Sanfilippo D MPS-IIID 

A/-Acetylglucosamine-6-sulfatase 

Behavioral problems, dysostosis multiplex, intellectual disability 

Morquio A MPS-IVA 

A/-Acetylglucosamine-6-sulfatase 

Short stature, bony dysplasia, hearing loss 

Morquio B MPS-IVB 

P-Galactosidase 

Short stature, bony dysplasia, hearing loss 

Maroteaux-Lamy MPS-VI 

Aryl sulfatase B 

Short stature, corneal clouding, cardiac valvular disease, 
dysostosis multiplex 

Sly MPS-VI 1 

P-Glucuronidase 

Coarse face, hepatosplenomegaly, corneal clouding, dysostosis 
multiplex 


*Hunter syndrome is an X-linked recessive disorder; the remaining MPS disorders are autosomal recessive. 

"^Dysostosis multiplex is a distinctive pattern of changes in bone, including a thickened skull, anterior thickening of the ribs, vertebral 
abnormalities, and shortened and thickened long bones. 


can be distinguished from each other by clinical, biochemical, 
and molecular analyses. Assays that measure enzyme activity 
in fibroblasts, leukocytes, serum, or dried blood spots are 
available for each MPS disorder, and prenatal testing after 
amniocentesis or chorionic villus sampling (see Chapter 13) 
is possible. Except for the X-linked Hunter syndrome, all of 
the MPS disorders are inherited in an autosomal recessive 
fashion. 

All MPS disorders are characterized by chronic and 
progressive multisystem deterioration, which causes hearing, 
vision, joint, and cardiovascular dysfunction (Fig. 7-8). 
Hurler, severe Hunter, and Sanfilippo syndromes are charac¬ 
terized by intellectual disability, and normal cognition is 
observed in other MPS disorders. 

Deficiency of iduronidase (MPS 1) is the prototypic MPS 
disorder. It produces a spectrum of phenotypes that have 
been traditionally delimited into three groups—Hurler, 
Hurler-Scheie, and Scheie syndromes—which manifest 
severe, moderate, and mild disease, respectively. MPS 1 disor¬ 
ders cannot be distinguished from each other by measuring 
enzyme activity; therefore, the MPS 1 phenotype is usually 
assigned on the basis of clinical criteria. The iduronidase gene 
has been cloned, and eventually genotype-phenotype corre¬ 
lations may lead to earlier and more accurate classification. 

Hunter syndrome (MPS II) is caused by a deficiency of 
iduronate sulfatase. It is categorized into mild and severe 
phenotypes based on clinical assessment. The onset of disease 
usually occurs between 2 and 4 years of age. Affected children 
develop coarse facial features, short stature, skeletal deformi¬ 
ties, joint stiffness, and intellectual disability. The gene that 
encodes iduronate sulfatase is composed of 9 exons spanning 
24 kb. Twenty percent of all identified mutations are large 
deletions, and most of the remainder are missense and non¬ 
sense mutations. 


Symptomatic treatment has been the standard of care 
for MPS for many years. More recently, restoration of endog¬ 
enous enzyme activity has been accomplished by either bone 
marrow transplantation (BMT) or enzyme replacement with 
recombinant enzyme. BMT has become the mainstay of 
treatment for persons with Hurler syndrome and has been 
shown to improve the coarse facial features, upper airway 
obstruction, and cardiac disease. It also appears to mitigate 
neurological deterioration, although the long-term outcomes 
are still being investigated. BMT has been less successful for 
other MPS disorders. BMT is, in general, also associated with 
substantial morbidity and mortality. Enzyme replacement for 
Hurler syndrome was approved by the U.S. Food and Drug 
Administration in 2003 and has been shown to improve hep- 
atosplenomegaly and respiratory disease. Early studies of 
enzyme replacement in Hunter syndrome (MPS II) and 
Maroteaux-Lamy syndrome (MPS VI) are promising. Enzyme 
replacement for Morquio A syndrome was recently approved 
by the FDA. 

Defects of glycosaminoglycan degradation 
cause a heterogeneous group of disorders 
called mucopolysaccharidoses (MPS disor¬ 
der). All MPS disorders are characterized by 
progressive multisystem deterioration causing 
hearing, vision, joint, and cardiovascular dys¬ 
function. These disorders can be distinguished 
from each other by clinical, biochemical, and 
molecular studies. 

Sphingolipidoses (Lipid Storage Diseases) 

Defects in the degradation of sphingolipids (sphingolipido¬ 
ses) result in their gradual accumulation, which leads to mul¬ 
tiorgan dysfunction (Table 7-4). Deficiency of the lysosomal 
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FIG 7-8 A, A boy with a mutation in a-L-iduronidase, which causes Hurler syndrome. Note his 
coarse facial features, crouched stance, thickened digits, and protuberant abdomen. B, Trans¬ 
genic mice with a targeted disruption of a-L-iduronidase. Progressive coarsening of the face is 
apparent as 8-week-old mice (left) grow to become 52-week-old mice (right). (Courtesy Dr. Lome 
Clarke, University of British Columbia.) 


TABLE 7-4 Lysosomal Storage Disorders’^ 


NAME 

MUTANT ENZYME 

CLINICAL FEATURES 

Tay-Sachs 

P-Hexosaminidase A 

Hypotonia, spasticity, seizures, blindness 

Gaucher (type 1; nonneuropathic) 

P-Glucosidase 

Splenomegaly, hepatomegaly, bone marrow infiltration, brain 
usually spared 

Niemann-Pick, type lA 

Sphingomyelinase 

Hepatomegaly, corneal opacities, brain deterioration 

Fabry 

a-Galactosidase 

Paresthesia of the hands and feet, corneal dystrophy, 
hypertension, renal failure, cardiomyopathy 

Gmi gangliosidosis (infantile) 

P-Galactosidase 

Organomegaly, dysostosis multiplex,'^ cardiac failure 

Krabbe 

galactosylceramidase 

Hypertonicity, blindness, deafness, seizures, 
(galactosylceramide-specific) atrophy of the brain 

Metachromatic leukodystrophy 

Aryl sulfatase A 

Ataxia, weakness, blindness, brain atrophy (late-infantile) 

Sandhoff 

P-Hexosaminidase (total) 

Optic atrophy, spasticity, seizures 

Schindler 

a-A/-Acetylgalactosaminidase 

Seizures, optic atrophy, retardation 

Multiple sulfatase deficiency 

Aryl sulfatase A, B, C 

Retardation, coarse facial features, weakness, 
hepatosplenomegaly, dysostosis multiplex 


*Of the lysosomal storage disorders included in this table, Fabry syndrome is X-linked recessive and the remainder are autosomal recessive. 
^Dysostosis multiplex is a distinctive pattern of changes in bone, including a thickened skull, anterior thickening of the ribs, vertebral 
abnormalities, and shortened and thickened long bones. 


enzyme glucosylceramidase (also known as glucocerebrosi- 
dase or (J-glucosidase), causes an accumulation of glucosyl- 
ceramide that results in Gaucher disease. This condition is 
characterized by visceromegaly (enlarged visceral organs), 
multiorgan failure, and debilitating skeletal disease. Gaucher 


disease has traditionally been divided into three subtypes, 
which can be distinguished by their clinical features. Type 1 
is most common and does not involve the central nervous 
system. Type 2 is the most severe, often leading to death 
within the first 2 years of life. Type 3 Gaucher disease is 
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intermediate between the other two forms. In practice, the 
clinical phenotypes overlap, and the spectrum of Gaucher is 
so broad that it ranges from death in utero to persons who 
remain asymptomatic even in old age. 

The extent to which specific organs are affected by Gaucher 
disease determines a person s clinical course. Splenomegaly, 
hepatomegaly, and pulmonary disease are shared among all 
three clinical types of Gaucher disease. Splenomegaly is asso¬ 
ciated with anemia, leukopenia, and thrombocytopenia, and 
splenic infarction can cause abdominal pain. Hepatomegaly 
can cause liver dysfunction, but cirrhosis and hepatic failure 
are uncommon. 

Gaucher disease is caused by more than 400 different 
mutations in GBA, the gene that encodes glucosylceramidase. 
The frequency of alleles that cause type 1 Gaucher disease is 
particularly high in Ashkenazi Jews, in whom the five most 
common alleles account for 97% of all mutations. Persons 
with the same genotype can have very different clinical out¬ 
comes. However, persons with at least one p.N370S allele, the 
most common allele, do not develop primary neurological 
disease and tend to have a milder outcome in general. 

Traditionally, treatment of persons with Gaucher disease 
was largely supportive (e.g., splenectomy for hypersplenism, 
blood transfusions for anemia). However, enzyme replace¬ 
ment can improve symptoms resulting from spleen, liver, and 
bone involvement in type 1 Gaucher disease. The effective¬ 
ness of enzyme replacement to treat neurological symptoms 
remains limited as the enzyme cannot pass through the 
blood-brain barrier. Some persons with severe involvement, 
particularly chronic neurological conditions, benefit from 
BMT. 

Enzymes that function in lysosomes are targeted and 
transported into the lysosomal space by specific pathways. 
Targeting is mediated by receptors that bind mannose-6- 


phosphate recognition markers attached to the enzyme (i.e., 
a posttranslational modification). The synthesis of these rec¬ 
ognition markers is deficient in I-cell disease (mucolipidosis 
II), so named because the cytoplasm of fibroblasts from 
affected persons was found by light microscopy to contain 
inclusions. These inclusions represent partially degraded oli¬ 
gosaccharides, lipids, and glycosaminoglycans. As a conse¬ 
quence of recognition marker deficiency, newly synthesized 
lysosomal enzymes are secreted into the extracellular space 
instead of being correctly targeted to lysosomes. Persons with 
I-cell disease have coarse facial features, skeletal abnormali¬ 
ties, hepatomegaly, corneal opacities, intellectual disability, 
and early death. There is no specific treatment for I-cell 
disease. 

Many different lysosomal enzymes catalyze 
the degradation of sphingolipids, glycosami¬ 
noglycans, glycoproteins, and glycolipids. 
Deficiency of a lysosomal enzyme causes 
accumulation of substrate, visceromegaly, 
organ dysfunction, and early death if untreat¬ 
ed. Genes encoding many of the lysosomal 
enzymes have been cloned, and treatment 
with bone marrow transplantation and enzyme 
replacement is effective for some conditions. 

Urea Cycle Disorders 

The primary role of the urea cycle is to prevent the accumula¬ 
tion of nitrogenous wastes by incorporating nitrogen into 
urea, which is subsequently excreted by the kidney. Addition¬ 
ally, the urea cycle is responsible for the de novo synthesis of 
arginine. The urea cycle consists of five major biochemical 
reactions (Fig. 7-9); defects in each of these steps have been 
described in humans. 




FIG 7-9 Schematic diagram of the urea cycle. AS, Argininosuccinase; ASA, argininosuccinic acid 
synthetase: CoA, coenzyme A; CPS, carbamyl phosphate synthetase: NAGS, A/-acetylglutamate 
synthetase: OTC, ornithine transcarbamylase. 
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Deficiencies of carbamyl phosphate synthetase (CPS), 
ornithine transcarbamylase (OTC), argininosuccinic acid 
synthetase (ASA), and argininosuccinase (AS) result in the 
accumulation of urea precursors such as ammonium and 
glutamine. Consequently, the clinical presentations of persons 
with CPS, OTC, ASA, and AS deficiencies are similar, produc¬ 
ing progressive lethargy and coma and closely resembling the 
clinical presentation of Reye syndrome. Affected persons 
present in the neonatal period or at any time thereafter, and 
there is wide interfamilial variability in severity. In contrast, 
arginase deficiency causes a progressive spastic quadriplegia 
and intellectual disability. Differential diagnosis of these dis¬ 
orders is based on biochemical testing. 

Each of these disorders, except OTC deficiency, is inherited 
in an autosomal recessive pattern. Although OTC deficiency 
is an X-linked disorder, women can be symptomatic carriers 
depending, in part, on the fraction of hepatocytes in which 
the normal aUele is inactivated. The goal of therapy for each 
disorder is to prevent hyperammonemia by providing suffi¬ 
cient calories with a low-protein diet to maintain normal 
growth and development. Affected persons often require 
ammonia scavenging drugs such as glycerol phenylbutyrate, 
sodium phenylbutyrate, or sodium benzoate as well as sup¬ 
plementation with citrulline or arginine. Acute hyperammo¬ 
nemia is treated with hemodialysis and intravenous sodium 
phenylacetate and sodium benzoate. 

OTC deficiency, an X-linked condition, is the most preva¬ 
lent of the urea cycle disorders and thus has been intensively 
studied. A variety of exon deletions and missense mutations 
have been described, and mutations that affect RNA process¬ 
ing have been observed. 

The urea cycle consists of five major bio¬ 
chemical reactions that convert nitrogenous 
waste products to urea, which is subsequently 
excreted by the kidney. Enzymatic defects in 
this pathway lead to the accumulation of urea 
precursors, progressive neurological impair¬ 
ment, and death if untreated. The genes that 
cause most of these disorders have been 
cloned, including the most common observed 
defect, X-linked ornithine transcarbamylase 
(OTC) deficiency. 

Energy Production 

Energy for cellular activities can be produced from many dif¬ 
ferent substrates, including glucose, ketones, amino acids, 
and fatty acids. Catabolism of these substrates requires their 
stepwise cleavage into smaller molecules (via processes such 
as the citric acid cycle or (J-oxidation), followed by passage 
of hydrogen ions through the oxidative phosphorylation 
(OXPHOS) system. Alternatively, some substrates are pro¬ 
cessed anaerobically. 

The OXPHOS system consists of five multiprotein com¬ 
plexes that transfer electrons to O 2 . These complexes are com¬ 
posed of more than 100 polypeptides and are located in the 


inner mitochondrial membrane. Thirteen of these polypep¬ 
tides are encoded by the mitochondrial genome (see Fig. 
5-14), and the remainder are encoded by nuclear genes. Thus, 
assembly and function of the OXPHOS system require 
ongoing signaling and transport between the nucleus and the 
mitochondrion. OXPHOS regulation is mediated by a wide 
variety of factors, including O 2 supply, hormone levels, and 
metabolite-induced transcription control. 

More than 20 clinical disorders characterized by OXPHOS 
defects are caused by substitutions, insertions, or deletions in 
the mitochondrial genome and are maternally inherited (see 
Chapter 5). In addition, nuclear genes that can cause mito¬ 
chondrial DNA (mtDNA) deletions or depletion of mtDNA 
have been isolated, and these disorders are inherited in an 
autosomal recessive pattern. Mutations in genes affecting the 
OXPHOS system produce very complex phenotypes as a con¬ 
sequence of the varying metabolic requirements of different 
tissues and systems at different developmental stages. 

Electron transfer flavoprotein (ETF) and ETF-ubiquinone 
oxidoreductase (ETF-QO) are nuclear-encoded proteins 
through which electrons can enter the OXPHOS system. 
Inherited defects in either of these proteins cause glutaric 
acidemia type II, which is characterized by hypotonia, hepa¬ 
tomegaly, hypoketotic or nonketotic hypoglycemia, and 
metabolic acidemia. Most affected persons present in the 
neonatal period or shortly thereafter, and despite aggressive 
therapy, affected children often die within months. 

In most tissues, the metabolism of pyruvate proceeds 
through pyruvate dehydrogenase, the citric acid cycle, and 
the OXPHOS system. However, in tissues with high glycolytic 
activity and a reduced or absent OXPHOS capacity, the 
end products of metabolism are pyruvate and lactic acid (see 
Fig. 7-3). Lactate is produced by the reduction of pyruvate, 
and the bulk of circulating lactate is usually absorbed by the 
liver and converted to glucose. Defects in the pathways of 
pyruvate metabolism produce lactic acidemia. The most 
common of such disorders is a deficiency of the pyruvate 
dehydrogenase (PDH) complex. It may be caused by muta¬ 
tions in the genes encoding one of five components of the 
PDH complex: Ei, E 2 , E 3 , X-lipoate, or PDH phosphatase. 
These disorders are characterized by varying degrees of lactic 
acidemia, developmental delay, and abnormalities of the 
central nervous system. It has been suggested that the facial 
features of some children with PDH deficiency resemble 
those of children with fetal alcohol syndrome (see Clinical 
Commentary 15-5 in Chapter 15). It has also been proposed 
that acetaldehyde from the circulation of mothers with alco¬ 
holism inhibits PDH in the fetus, creating a phenocopy of 
PDH deficiency. 

The phenotype produced by defects of energy 
metabolism is complex because of the varying 
oxidative demands of different tissues and 
organs. Once the condition is diagnosed, the 
goal of treatment is to use alternative path¬ 
ways of energy production. 
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Transport Systems 

The efficient movement of molecules between compartments 
(e.g., organelles, cells, the environment), and hence across a 
barrier, often requires a macromolecule that connects the 
compartments and mediates transport through the barrier. 
Abnormalities of these transport systems have myriad 
effects, depending on whether altered barrier integrity or the 
accumulation of substrate has a greater impact on normal 
physiology. 

Cystine 

Cystine is the disulfide derivative of the amino acid cysteine. 
Abnormal cystine transport can produce two diseases: cystin- 
uria and cystinosis. Both disorders are inherited in an auto¬ 
somal recessive fashion. 

Abnormal cystine transport between cells and the extra¬ 
cellular environment causes cystinuria, one of the most 
common inherited disorders of metabolism. Although cys¬ 
tinuria produces substantial morbidity, early death is uncom¬ 
mon. Cystinuria is a genetically heterogeneous disorder 
caused by a defect of dibasic amino acid transport affecting 
the epithelial cells of the gastrointestinal tract and renal 
tubules. As a result, cystine, lysine, arginine, and ornithine are 
excreted in urine in quantities higher than normal. Cystine is 
the most insoluble of the amino acids; therefore, elevated 
urinary cystine predisposes to the formation of renal calculi 
(kidney stones). Complications of chronic nephrolithiasis 
(presence of kidney stones) include infection, hypertension, 
and renal failure. Treatment of cystinuria consists largely of 
rendering cystine more soluble. This is accomplished by 
administering pharmacological amounts of water (4-6 L/ 
day), alkalinizing the urine, and using chelating agents such 
as penicillamine. 

Based on amino acid excretion studies, cystinuria has 
been divided into three phenotypes. Type 1 cystinuria has 
been associated with missense, nonsense, and deletion 
mutations in a gene termed soluble carrier family 3, member 
1 amino acid transporter (SLC3A1). Types 11 and III cystin¬ 
uria are caused by mutations in a gene called SLC7A9. 
SLC3A1 and SLC7A9 encode the heavy and light subunits 
of the amino acid transporter b°’^ located on the brush- 
border plasma membrane of epithelial cells in the proximal 
tubules of the kidney. In vitro studies of mutant SLC3AI 
and SLC7A9 protein have demonstrated a marked reduction 
in transport activity, providing direct evidence for the role 
of these proteins in cystinuria. 

Cystinosis is a rare disorder caused by a diminished ability 
to transport cystine across the lysosomal membrane. This 
produces an accumulation of cystine crystals in the lysosomes 
of most tissues. Affected persons are normal at birth but 
develop electrolyte disturbances, corneal crystals, rickets, and 
poor growth by the age of 1 year. Renal glomerular damage 
is severe enough to necessitate dialysis or transplantation 
within the first decade of life. Transplanted kidneys function 


normally, but chronic complications such as diabetes melli- 
tus, pancreatic insufficiency, hypogonadism, myopathy, and 
blindness occur. Until recently, treatment was largely sup¬ 
portive, including renal transplantation. However, cysteine- 
depleting agents such as cysteamine have proved successful in 
slowing renal deterioration and improving growth. Long- 
acting slow release cysteamine was recently approved by the 
FDA. Cystinosis is caused by mutations in the gene, CTNS, 
encoding an integral lysosomal membrane protein. 

The phenotype of transport defects is partly 
contingent on the degree of barrier disruption 
as well as the compartments through which 
normal traffic is compromised. Abnormal 
cystine transport between cells and the extra¬ 
cellular environment causes cystinuria, renal 
disease, and hypertension. Cystinosis is pro¬ 
duced by a defect of cystine efflux from the 
lysosome; it leads to severe chronic disabilities 
and, if untreated, death. 

Heavy Metals 

Many of the enzymes controlling metabolic processes require 
additional factors (cofactors) to function properly and effi¬ 
ciently. These cofactors are commonly trace elements such as 
ions of heavy metals (metals that are more dense than those 
in the first two groups of the periodic table). At least 12 trace 
elements are essential to humans. For example, a zinc ion acts 
as a cofactor in carbonic anhydrase, placing a hydroxide ion 
next to carbon dioxide to facilitate the formation of bicar¬ 
bonate. Although an adequate supply of trace elements is 
critical for normal metabolism, excessive amounts of circu¬ 
lating and/or stored heavy metals are highly toxic. Accord¬ 
ingly, a complex series of transport and storage proteins 
precisely control heavy metal homeostasis. Abnormalities 
of these proteins cause progressive dysfunction of various 
organs, often leading to premature death if untreated. Human 
disorders that disrupt the normal homeostasis of copper 
(Wilson disease, Menkes disease, occipital horn syndrome), 
iron (hereditary hemochromatosis; Clinical Commentary 
7-2), and zinc (hereditary acrodermatitis enteropathica) have 
been described. 

Copper 

Copper is absorbed by epithelial cells of the small intestine 
and is subsequently distributed by various chaperone pro¬ 
teins that shuttle it to different places in the cell (e.g., cyto¬ 
plasmic enzymes that use copper as a cofactor, enzymes in 
the mitochondria). Some copper is transported to the liver to 
be incorporated into proteins that distribute it to other parts 
of the body (e.g., brain). Excess copper in hepatocytes is 
secreted into bile and excreted from the body. 

Menkes disease (MND) is an X-linked recessive disorder 
described in 1962 by John Menkes, who studied five male 







CLINICAL COMMENTARY 7-2 
Hereditary Hemochromatosis 

The term hemochromatosis refers to all disorders character¬ 
ized by excessive iron storage. A subgroup of these disorders 
are hereditary and can be caused by mutations in one of several 
different genes. The most common form of hereditary hemo¬ 
chromatosis (HH) is an autosomal recessive disorder of iron 
metabolism in which excessive iron is absorbed in the small 
intestine and then accumulates in a variety of organs such as 
the liver, kidney, heart, joints, and pancreas. It was described 
by von Recklinghausen, the same physician who described 
neurofibromatosis 1, in 1889. Approximately 1 of every 8 north¬ 
ern Europeans is a carrier for HH, and 1 of every 200 to 400 
persons is a homozygote. Although the penetrance of the 
disease-causing genotype is incomplete (as discussed later), 
HH is one of the most common genetic disorders observed in 
people of European ancestry. Its prevalence is much lower in 
Asian and African populations. 

The most common symptom of HH is fatigue, although the 
clinical presentation of patients with HH can vary considerably. 
Additional findings include joint pain, diminished libido, diabe¬ 
tes, increased skin pigmentation, cardiomyopathy, liver enlarge¬ 
ment, and cirrhosis. Abnormal serum iron parameters can 
identify most men at risk for iron overload, but HH is not 
detected in many premenopausal women. The most sensitive 
diagnostic test for HH is a liver biopsy accompanied by histo- 
chemical staining for hemosiderin (a form of stored iron). 

As early as the 1970s, an increased frequency of the human 
leukocyte antigen HLA-A3 allele in HH patients indicated that 
an HH gene might be located near the major histocompatibility 
region (MHC) on chromosome 6p. Subsequent linkage studies 
confirmed this hypothesis in the late 1970s, but it was not until 
1996 that the HH gene was cloned. The HH gene is a widely 
expressed HLA class l-like gene, HFE. The gene product is a 
cell-surface protein that binds to the transferrin receptor (trans¬ 
ferrin carries iron molecules), overlapping the binding site for 
transferrin and inhibiting transferrin-mediated iron uptake. 
However, this does not directly affect iron transport from the 


small intestine. Instead, this interaction is thought to be 
involved in a cell's ability to sense iron levels. This function is 
disrupted in persons with mutations in HFE, resulting in exces¬ 
sive iron absorption from the small intestine and iron overload. 
Thus, hemochromatosis is not caused by a defect of an iron 
transport protein but rather by a defect in the regulation of 
transport. 

A single missense mutation that results in the substitution of 
a tyrosine for cysteine in a p2-microglobulin-binding domain 
accounts for 85% of all HH-causing mutations. A single ances¬ 
tral haplotype predominates in Europeans, suggesting that 
there was a selective advantage conferred by having at least 
one copy of the HH gene. Because iron deficiency affects one 
third of the world's population and is significantly less common 
in HH heterozygotes, it is likely that this explains the higher 
incidence of HH in many populations. 

Treatment of HH consists of reducing the accumulated iron 
in the body. This is accomplished by serial phlebotomy or by 
the use of an iron-chelating agent such as deferoxamine. 
Depending on the quantity of iron stored, return to a normal 
level of iron can take a few years. However, iron reduction 
prevents further liver damage, cures the cardiomyopathy, 
returns skin pigmentation to normal, and might improve the 
diabetes. Persons who have not developed irreversible liver 
damage have a nearly normal life expectancy. 

The estimated penetrance of HH depends on a person's age, 
sex, and whether the presence of disease is measured by 
histological findings such as hepatic fibrosis or clinical symp¬ 
toms. Most men who are homozygous for an HH-causing 
mutation do not develop clinical symptoms, and those who do 
are seldom symptomatic before the age of 40 years. An even 
smaller fraction of homozygous women develop clinical symp¬ 
toms. If symptoms are seen, they typically occur 20 years or 
so later than in men because iron accumulation in women is 
tempered by iron losses during menstruation, gestation, and 
lactation (Fig. 7-1 0). 
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FIG 7-10 Comparison of hemosiderin stain of normal liver (A) with hemosiderin stain of livers 
from persons affected with hemochromatosis (B, C, and D). Note the varying degree of increased 
deposition of hemosiderin in livers of HH homozygotes. This damages the liver, impairs its func¬ 
tion, and can lead to cirrhosis and liver cancer. 
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siblings, all of whom died before 3 years of age. MND 
is characterized by intellectual disability, seizures, hypother¬ 
mia, twisted and hypopigmented hair (pili torti), loose skin, 
arterial rupture, and death in early childhood. In MND 
patients, copper can be absorbed by the gastrointestinal epi¬ 
thelium but cannot be exported effectively from these cells 
into the blood stream. Consequently, when the intestinal cells 
slough, the trapped copper is excreted from the body. The 
lack of available copper leads to an overall deficiency of 
copper. 

Copper is a required cofactor in tyrosinase, lysyl oxidase, 
superoxide dismutase, cytochrome c oxidase, and dopamine 
P-hydroxylase. Reduced availability of copper leads to 
impaired enzyme function, explaining the major clinical fea¬ 
tures of MND. For example, lysyl oxidase is required for 
cross-linking collagen and elastin; therefore, ineffectual cross- 
linking leads to weakened vascular walls and laxity of the skin. 
Treatment of MND consists of restoring copper levels in the 
body to normal. Because copper cannot be absorbed via the 
gastrointestinal tract in MND patients, it must be adminis¬ 
tered by an alternative route such as subcutaneous injections. 
Patients treated this way have demonstrated some clinical 
improvement. However, none of the abnormalities is com¬ 
pletely corrected or prevented. Based on studies in an animal 
model of MND, the brindled mouse, it has been proposed 
that treatment for MND would be most effective if it were 
started in mid-gestation. Consequently, prenatal therapy is 
being investigated. 

In contrast to MND, in which a deficiency of copper causes 
disease, Wilson disease (WND) results from an excess of 
copper caused by defective excretion of copper into the biliary 
tract. This causes progressive liver disease and neurological 
abnormalities. WND, an autosomal recessive disorder, was 
first described by Kinnear Wilson in 1912 and was called 
hepatolenticular degeneration because of end-stage destruc¬ 
tion of the liver and brain. Only in the 1940s did it become 
clear that these findings were due to the accumulation of 
copper. Further progress in understanding the defect under¬ 
lying WND was not made until the 1990s. 

Patients with WND usually present with acute or chronic 
liver disease in childhood. If WND is left untreated, the liver 
disease is progressive, resulting in liver insufficiency, cirrhosis, 
and failure. Adults commonly develop neurological symp¬ 
toms such as dysarthria (the inability to correctly articulate 
words) and diminished coordination. Accumulation of 
copper can also cause arthropathy (inflammation of the 
joints), cardiomyopathy (abnormal function of the heart 
muscle), kidney damage, and hypoparathyroidism (dimin¬ 
ished secretion of or response to parathormone). Deposition 
of copper in Descemet’s membrane (at the limbus of the 
cornea) produces a characteristic finding in the eye (the 
Kayser-Fleischer ring), which is observed in 95% of all WND 
patients and 100% of WND patients with neurological 
symptoms. 

Biochemical testing can be used to confirm the diagnosis 
of WND. Findings include decreased serum ceruloplasmin, 
increased serum nonceruloplasmin copper, increased urinary 


copper excretion, and increased deposition of copper in the 
liver. The most sensitive indicator of WND is the reduced 
incorporation of isotopes of copper into cells cultured in 
vitro. Treatment of WND consists of reducing the load of 
accumulated copper through the use of chelating agents such 
as penicillamine and trientine. Zinc salts are effective chelat¬ 
ing agents for maintenance of copper levels. 

In 1993 the gene responsible for MND, ATP7A, was 
cloned. It encodes an adenosine triphosphatase with six 
tandem copies of a heavy-metal-binding sequence homolo¬ 
gous to previously identified bacterial proteins that confer 
resistance to toxic heavy metals. The high sequence conserva¬ 
tion between human and bacterial binding sequences 
indicated that ATP7A has an important role in regulating 
heavy-metal ion transport. ATP7A is expressed in a variety 
of tissues but not the liver, suggesting that a similar gene 
that is expressed in the liver might cause WND. Portions 
of the ATP7A gene were used as a probe to test for the pres¬ 
ence of a similar gene on chromosome 13 (the known loca¬ 
tion of a WND gene, revealed by linkage analysis). This 
strategy led to the cloning in 1993 of ATP7B, the gene that, 
when mutated, causes WND. The protein product is highly 
homologous (76% amino acid homology) to that of ATP7A. 
In contrast to ATP7A, ATP7B is expressed predominantly in 
the liver and kidney, which are major sites of involvement 
in WND. 

The ATP7A protein is usually localized to the Golgi 
network within a cell, where it supplies copper to various 
enzymes. When copper levels in an epithelial cell of the small 
intestine exceed a certain concentration, ATP7A redistributes 
to the plasma membrane and pumps copper into the blood 
stream. When copper levels drop, ATP7A returns to the Golgi 
network. Thus, it mediates the efflux of copper into the blood 
stream. ATP7A is also an important transporter of copper 
across the blood-brain barrier. 

A variety of missense, nonsense, and splice site mutations 
in ATP7A have been found in MND patients. Approximately 
15% to 20% of the mutations in ATP7A are large deletions. 
Several mutations of splice sites in ATP7A have been associ¬ 
ated with another disorder called the occipital horn syndrome 
(also known as X-linked cutis laxa or Ehlers-Danlos syn¬ 
drome type IX), which is characterized by mild intellectual 
disability, bladder and ureteric diverticula (cul-de-sac-like 
herniations through the wall), skin and joint laxity, and ossi¬ 
fied occipital (the most posterior bone of the calvarium) 
horns. These mutations permit the production of a small 
amount of normal protein and prevent development of 
severe neurological symptoms. 

ATP7B plays a similar role as an effector of copper trans¬ 
port. However, it moves between the Golgi network and 
either endosomes or the cell membrane of hepatocytes, and 
it controls the excretion of copper into the biliary tree. ATP7B 
also aids in the incorporation of copper into ceruloplasmin. 
Several hundred different mutations have been described in 
patients with WND. A single missense mutation accounts for 
about 40% of disease-causing alleles in persons of northern 
European ancestry. 
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Wilson disease (WND) is an autosomal reces¬ 
sive disorder characterized by progressive 
liver disease and neurological abnormalities. 
Menkes disease (MND) is an X-linked reces¬ 
sive disorder characterized by intellectual dis¬ 
ability, seizures, and death in early childhood. 
Accumulation of excess copper causes disease 
in WND, and MND results from a lack of 
copper and impaired enzyme function. WND 
and MND are caused by mutations in the 
highly homologous genes, ATP7B and ATP7A, 
respectively. 

Zinc 

Acrodermatitis enteropathica (AE) is caused by a defect in the 
absorption of zinc from the intestinal tract. Persons with AE 
experience growth retardation, diarrhea, dysfunction of the 
immune system, and a severe dermatitis (inflammation of the 
skin) that typically affects the skin of the genitals and but¬ 
tocks, around the mouth, and on the limbs (Fig. 7-11). Chil¬ 
dren usually present after weaning, and AE can be fatal if it 
is not treated with high doses of supplemental zinc, which is 
curative. AE is caused by mutations in SLC39A4, which 
encodes a putative zinc-transporter protein expressed on the 
apical membrane of the epithelial cell of the small intestine. 
It is unknown whether persons with AE can still absorb small 
amounts of zinc through a mutant form of this transporter 
or whether another transporter exists that can also transport 
zinc when it is given in high doses. 



FIG 7-11 A child with acrodermatitis enteropathica caused 
by mutations in SLC39A4, encoding a protein necessary for 
intestinal absorption of zinc. The resulting deficiency of zinc 
produces a characteristic scaly, red rash around the mouth, 
genitals, buttocks, and limbs. (Courtesy Dr. Virginia Sybert, 
University of Washington.) 



I STUDY QUESTIONS 


1. Garrod found alkaptonuria to be more common in 
the offspring of consanguineous matings. Explain this 
finding. In general, what is the association between the 
coefficient of relationship and the prevalence of an 
inborn error of metabolism? 

2. If many metabolic reactions can proceed in the absence 
of an enzyme, explain how diminished or absent enzyme 
activity can produce disease. 

3. Despite the low prevalence of most metabolic disorders, 
why is it important to understand the pathogenesis of 
inborn errors of metabolism? 

4. Describe three types of metabolic processes and give 
examples of disorders disturbing each of them. 

5. Galactosemia is diagnosed in a 1-week-old neonate, 
yet enzymatic activity of GAL-l-P uridyl transferase is 
normal. Interpret these results and explain how mutations 
of different genes can produce a similar phenotype. 

6 . The prevalence of PKU varies from 1 in 10,000 to less 
than 1 in 100,000. Explain how prevalence rates of inborn 


errors of metabolism can vary so widely among different 
ethnic groups. 

7. An 18-year-old woman comes to your office for prenatal 
counseling. She had a younger brother who died from a 
defect of mitochondrial fatty acid oxidation when he was 
a few months old. What is her risk of having an affected 
child? Explain your answer. 

8 . An 8-year-old girl develops hyperammonemia and is 
critically ill. Biochemical testing performed on a liver 
biopsy confirms that she has OTC deficiency. Which 
genetic test would you order next? Why? 

9. Disorders of the OXPHOS system are commonly associ¬ 
ated with elevated blood lactic acid levels. Explain this 
finding. 

10. Polymorphisms that affect metabolism presumably have 
been maintained by selection because they offer hetero¬ 
zygotes a slight advantage. Provide an example of how 
these polymorphisms might have been advantageous to 
a group of hunter-gatherers 10,000 years ago. 
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CHAPTER 


Disease-Gene Identification 


The identification of mutations that cause disease is a central 
focus of medical genetics. With the completion of the Human 
Genome Project (see Box 8-2 later in this chapter), the loca¬ 
tions of virtually every human gene in the genome are now 
known. The availability of these data, along with dramatic 
advances in molecular genetic technology and important 
developments in the statistical analysis of genetic data, 
has greatly expedited the discovery of disease-causing muta¬ 
tions. However, the specific genetic alterations responsible for 
many inherited disease phenotypes remain unknown (i.e., 
which gene or genes contribute to which disease). Much 
research is now devoted to discovering these genetic muta¬ 
tions and their consequences. As this work progresses, our 
understanding of the biological basis of genetic disease will 
continue to increase. 

Identifying disease-causing genes is an important step in 
the understanding, diagnosis, and eventual treatment of a 
genetic disease. When a disease-causing gene’s location has 
been pinpointed, it is often possible to provide a more accu¬ 
rate prognosis for persons at risk for a genetic disease. The 
DNA sequence that makes up the gene can be analyzed, and 
its RNA and/or protein product can be studied. This can 
contribute to our understanding of the cause of the disease. 
Furthermore, discovery of a disease-associated gene can open 
the way to modification of the abnormal gene product (e.g., 
preventing its expression) or the manufacture of a normal 
gene product through recombinant DNA techniques, permit¬ 
ting more effective treatment of a genetic disease. For example, 
recombinant clotting factor VIII is used to treat hemophilia 
A, as discussed in Chapter 5, and recombinant insulin is used 
in the treatment of type 1 diabetes. Gene therapy—modifying 
genes of persons with a genetic disease—also becomes a pos¬ 
sibility. Thus, disease-gene discovery contributes directly to 
many of the primary goals of medical genetics. 

This chapter discusses the approaches commonly used in 
gene mapping and identification. Frequently, this process 
begins by mapping mutations that occur in affected persons 
to specific locations on chromosomes. Two major types of 
gene mapping can be distinguished. In gene mapping, the 
frequency of meiotic crossovers between loci is used to esti¬ 
mate distances between loci. Physical mapping involves 
using cytogenetic, molecular, and computational methods to 
determine the physical locations of disease-causing muta¬ 
tions on chromosomes. 


GENE MAPPING _ 

Linkage Analysis 

One of Gregor Mendel’s laws, the principle of independent 
assortment, states that an individual’s genes will be transmit¬ 
ted to the next generation independently of one another (see 
Chapter 4). Mendel was not aware that genes are located on 
chromosomes and that genes located near one another on the 
same chromosome are transmitted together rather than inde¬ 
pendently. Thus, the principle of independent assortment 
holds true for most pairs of loci, but not for those that occupy 
the same region of a chromosome. Such loci are said to be 
linked. 

Figure 8-1 depicts two loci, A and B, that are located close 
together on the same chromosome. A third locus, C, is located 
on another chromosome. In the individual in our example, 
each of these loci has two alleles, designated 1 and 2. A and 
B are linked, so A, and Bj are inherited together. Because A 
and C are on different chromosomes and thus unlinked, their 
alleles do follow the principle of independent assortment. 
Hence, if the process of meiosis places Aj in a gamete, the 
probability that Cj will be found in the same gamete is 50%. 

Recall from Chapter 2 that homologous chromosomes 
sometimes exchange portions of their DNA during prophase 
I (this is known as crossing over or crossover). The average 
chromosome experiences one to three crossover events 
during meiosis. As a result of crossover, new combinations of 
alleles can be formed on a chromosome. Consider again the 
linked loci, A and B, in Figure 8-1. Alleles Aj and Bj are 
located close together on one chromosome, and alleles A 2 and 
B 2 are located on the homologous chromosome. The combi¬ 
nation of alleles on each chromosome is a haplotype (from 
“haploid genotype”). The two haplotypes of this individual 
are denoted A 1 B 1 /A 2 B 2 . As Figure 8-2A shows, in the absence 
of crossover AjBj will be found in one gamete and A 2 B 2 in 
the other. But when there is a crossover, new allele combina¬ 
tions, AjB 2 and A 2 BJ, will be found in the two gametes (see 
Fig. 8-2B). The process of forming such new arrangements of 
alleles is called recombination. Crossover does not necessar¬ 
ily lead to recombination, however, because a double cross¬ 
over can occur between two loci, resulting in no recombination 
(see Fig. 8-2C). 

As Figure 8-3 shows, crossovers are more likely to occur 
between loci that are situated far apart on a chromosome 
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than between loci that are situated close together. Thus, the 
distance between two loci can be inferred by estimating how 
frequently recombinations occur in families (this is called the 
recombination frequency). If, in a large series of meioses 
studied in families, the alleles of A and B undergo recombina¬ 
tion 5% of the time, then the recombination frequency for A 
and B is 5%. 

The genetic distance between two loci is measured in cen- 
timorgans (cM), in honor of T. H. Morgan, who discovered 
the process of crossing over in 1910. One cM is approximately 
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FIG 8-1 Loci A and B are linked on the same chromosome, 
so alleles A, and B, are usually inherited together. Locus C 
is on a different chromosome, so it is not linked to A and 6 , 
and its alleles are transmitted independently of the alleles of 
A and B. 


equal to a recombination frequency of 1%. The relationship 
between recombination frequency and genetic distance is 
approximate, because double crossovers produce no recom¬ 
bination. The recombination frequency thus underestimates 
map distance, especially as the recombination frequency 
increases above about 10%. Mathematical formulae have 
been devised to correct for this underestimate. 

Loci that are on the same chromosome are said to be 
syntenic (meaning “same thread”). If two syntenic loci are 50 
cM apart, they are considered to be unlinked. This is because 
their recombination frequency, 50%, is equivalent to inde¬ 
pendent transmission, as in the case of alleles of loci that are 
on different chromosomes. (To understand this, think of the 
chromosomes shown in Figure 8-1: If a person transmits 
allele Aj, the probability that he or she also transmits allele 
Cl, which is on another chromosome, is 50%, and the prob¬ 
ability that he or she transmits allele C 2 is also 50%.) 

Crossovers between loci on the same chromo¬ 
some can produce recombination. Loci on the 
same chromosome that experience recombi¬ 
nation less than 50% of the time are said to 
be linked. The distance between loci can be 
expressed in centimorgans (cM); 1 cM repre¬ 
sents a recombination frequency of approxi¬ 
mately 1%. 
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FIG 8-2 The genetic results of crossover. A, No crossover: A, and 6 , remain together after 
meiosis. B, A crossover between A and B results in a recombination: A, and 62 are inherited 
together on one chromosome, and A^ and 6 , are inherited together on another chromosome. 
C, A double crossover between A and B results in no recombination of alleles. (Modified from 
McCance KL, Huether SE: Pathophysiology. The Biologic Basis for Disease in Adults and Chil¬ 
dren. 5th ed. St Louis: Mosby; 2005.) 
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RECOMBINANTS 

FIG 8-3 Crossover is more likely between loci that are far 
apart on chromosomes (left) than between those that are 
close together (right). 


Recombination frequencies can be estimated by observing 
the transmission of genes in pedigrees. Figure 8-4 is an 
example of a pedigree in which neurofibromatosis type 1 
(NFl) is being transmitted. The members of this pedigree 
have also been typed for a two-allele single nucleotide poly¬ 
morphism (SNP) which, like the NPl gene, is located on 
chromosome 17. The SNP genotypes are shown below each 
individual’s number in the pedigree. Examination of 


generations I and II allows us to determine that, under the 
hypothesis of linkage between NFl and the SNP, the disease- 
causing mutation in the NFl gene must be on the same copy 
of chromosome 17 as allele 1 of the SNP in this family, 
because individual 1-2, who is homozygous for allele 2, is 
unaffected with the disease. Only the affected father (I-l), 
who is a heterozygote for the SNP, could have transmitted a 
copy of chromosome 17 that contains both the NPl disease 
allele and SNP allele 1 to the daughter (II-2). 

The arrangement of these alleles on each chromosome is 
referred to as linkage phase. With the linkage phase known, 
individual II-2's haplotypes would then be Nl/n2, where N 
indicates the mutated allele causing NFl, n indicates the 
normal allele, and 1 and 2 are the two SNP alleles (in other 
words, individual II-2 has one copy of chromosome 17 that 
contains both the disease-causing mutation N and SNP allele 
1, and her other copy of chromosome 17 contains the normal 
allele n and SNP allele 2). This woman’s husband (individual 
II-1) is not affected with the disease and is a homozygote for 
SNP allele 2. Fie must have the haplotypes n2/n2. If the NFl 
locus and the SNP are linked, the children of this union who 
are affected with NFl should usually have SNP allele 1, and 
those who are unaffected should have allele 2. In seven of 
eight children in generation III, we find this to be true. In one 
case, a recombination occurred (individual III-6). This gives 
a recombination frequency of 1/8, or 12.5%, supporting the 
hypothesis of linkage between NFl and the SNP. A recombi¬ 
nation frequency of 50% would support the hypothesis that 
the two loci are not linked. Note that the pedigree allows us 
to determine linkage phase in individual II-2, but we cannot 
determine whether a recombination took place in the gamete 
transmitted to II-2 by her father. Thus, the recombination 
frequency is estimated only in the descendants of II-2. In 
actual practice, a much larger sample of families would be 
studied to ensure the statistical accuracy of this result. 

Estimates of recombination frequencies are 
obtained by observing the transmission of 
alleles in families. Determination of the linkage 
phase (i.e., the chromosome copy on which 
each allele is located) is an important part of 
this procedure. 

Polymorphisms that are used to follow a disease-causing 
allele through a family are termed markers (i.e., they can 
mark the chromosome on which a disease-causing allele is 
located). Because linked markers can be typed in an indi¬ 
vidual of any age (even in a fetus), they are useful for the early 
diagnosis of genetic disease (see Chapter 13). It is important 
to emphasize that a marker locus simply helps us to deter¬ 
mine which member of a chromosome pair is being transmit¬ 
ted by a parent; the actual cause of the genetic disease is 
usually a nearby mutation which may be identified in subse¬ 
quent DNA sequencing analysis. 

In general, 1 cM corresponds to approximately 1 million 
base pairs (1 Mb) of DNA. Flowever, this is only an approxi¬ 
mate relationship, because several factors are known to 
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FIG 8-4 A neurofibromatosis type 1 pedigree in which each 
member has been typed for a two-allele single nucleotide 
polymorphism (SNP). Genotypes for this two-allele marker 
locus are shown below each individual in the pedigree. 
Affected pedigree members are indicated by a shaded 
symbol. 


influence crossover rates. First, crossovers are roughly 1.5 
times more common during female meiosis (oogenesis) than 
during male meiosis (spermatogenesis). Also, crossovers tend 
to be especially common near the telomeres of chromosomes. 
Finally, some small chromosome regions (one to a few kb in 
size) exhibit crossover rates that are at least 10-fold higher 
than elsewhere in the genome. The human genome contains 
thousands of these recombination hot spots, and they 
account for more than half of all recombination events. 

Although there is a correlation between centimorgans and 
actual physical distances between loci, this relationship is 
complicated by sex differences in recombination, higher 
recombination frequencies near telomeres, and the existence 
of recombination hot spots. 

LOD Scores: Determining the Significance 
of Linkage Results 

As in any statistical study, we must be careful to ensure that 
the results obtained in a linkage analysis are not due simply 
to chance. For example, consider a two-allele marker locus 
that has been typed in a pedigree. It is possible by chance for 
all affected offspring to inherit one allele and for all unaf¬ 
fected offspring to inherit the other allele even if the marker 
is not linked to the disease-causing gene. This misleading 
result becomes less likely as we increase the number of sub¬ 
jects in our linkage study (just as the chance of a strong 
deviation from a 50/50 heads-to-tails ratio becomes smaller 
when we toss a coin many times). 

Fiow do we determine whether a linkage result is likely to 
be due to chance alone? We begin by comparing the likeli¬ 
hood (a likelihood is similar in concept to a probability) that 
two loci are linked at a given recombination frequency 
(denoted 0) versus the likelihood that the two loci are not 
linked (recombination frequency = 50%, or 9 = 0.5). Suppose 
we wish to test the hypothesis that two loci are linked at a 
recombination frequency of 9 = 0.1 versus the hypothesis that 
they are not linked. We use our pedigree data to form a likeli¬ 
hood ratio: 

likelihood of observing pedigree data if 9 = 0.1 
likelihood of observing pedigree data if 9 = 0.5 


If our pedigree data indicate that 9 is more likely to be 0.1 
than 0.5, then the likelihood ratio (or odds) will be greater 
than 1.0. If, however, the pedigree data argue against linkage 
of the two loci, then the denominator will be greater than the 
numerator, and the ratio will be less than 1.0. For conve¬ 
nience, the common logarithm’^ of the ratio is usually taken; 
this logarithm of the odds is termed a LOD score. Convention¬ 
ally, a LOD score of 3.0 or more is accepted as evidence of 
linkage; a score of 3.0 indicates that the likelihood in favor 
of linkage is 1000 times greater than the likelihood against 
linkage. Conversely, a LOD score lower than -2.0 (odds of 
100 to 1 against linkage) is considered to be evidence that two 
loci are not linked. Box 8-1 provides details on the calculation 
of LOD scores. 

The statistical odds that two loci are a given 
number of centimorgans apart can be calcu¬ 
lated by measuring the ratio of two likelihoods: 
the likelihood of linkage at a given recombina¬ 
tion frequency divided by the likelihood of no 
linkage. The logarithm of this odds ratio is a 
LOD score. LOD scores of 3.0 or higher are 
taken as evidence of linkage, and LOD scores 
lower than -2.0 are taken as evidence that the 
two loci are not linked. 

Linkage Analysis and the Human Gene Map 

Suppose that we are studying a disease-causing gene in a 
series of pedigrees, and we wish to map it to a specific chro¬ 
mosome location. Typically, we would type the members of 
our pedigree for marker loci whose locations on each chro¬ 
mosome have been established. Using the techniques just 
described, we test for linkage between the disease-causing 
gene and each marker. Most of these tests would yield nega¬ 
tive LOD scores, indicating no linkage between the marker 


"^Recall that the common logarithm (logio) of a number is the power 
to which 10 is raised to obtain the number. The common logarithm 
of 100 is 2, the common logarithm of 1000 is 3, and so on. 
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BOX 8-1 Estimating LOD Scores in Linkage Analysis 


A simple example will help to illustrate the concepts of likeli¬ 
hood ratios and LOD scores. Consider the pedigree diagram in 
the figure below, which illustrates another family in which NF1 
is being transmitted. The family has been typed for a two-allele 
SNP, as in Figure 8-4. The male in generation II must have 
received SNP allele 7 from his mother, because she can trans¬ 
mit only this marker allele. Thus, his copy of allele 2 had to 
come from his father, on the same chromosome copy as the 
NF1 disease gene (under the hypothesis of linkage). This 
allows us to establish linkage phase in this pedigree: The 
affected male in generation II must have the haplotypes N2/ 
n1. He marries an unaffected woman who is a homozygote for 
allele 2. Thus, the hypothesis of close linkage (6 = 0.0) predicts 
that each child in generation III who receives allele 2 from his 
or her father must also receive the NF1 disease allele. Under 
the hypothesis of linkage, the father can transmit only two 
possible combinations: either the chromosome copy that 
carries both the disease-causing gene and allele 2 (N2 haplo- 
type) or the other chromosome copy, which has the normal 
gene and SNP allele 7 (haplotype n1). The probability of each 
of these events is 1/2. Therefore, if 0 = 0.0, the probability of 
observing five children with the genotypes shown in the figure 
below is (1/2)’’, or 1/32 (i.e., the multiplication rule is applied to 
obtain the probability that all five of these events will occur 
together). This is the numerator of the likelihood ratio. 



1,2 2,2 2,2 1,2 1,2 
An NFl pedigree in which each member has been typed 
for a 2-allele single nucleotide polymorphism (SNP). The 
SNP marker genotypes are shown below each individual in 
the pedigree. 

Now consider the likelihood of observing these genotypes if 
the SNP and NFl are not linked (0 = 0.5). Under this hypothesis, 
there is independent assortment of the two SNP alleles and 
NFl. The father could transmit any of four combinations (N1, 
N2, n1, and n2) with equal probability (1/4). The probability of 
observing five children with the observed genotypes would 
then be (1/4)® = 1/1024. This likelihood is the denominator of 
the likelihood ratio. The likelihood ratio is then 1/32 divided by 
1/1024, or 32. Thus, the data in this pedigree tell us that 
linkage, at 0 = 0.0, is 32 times more likely than nonlinkage. 

If we take the common logarithm of 32, we find that the LOD 
score is 1.5, which is still far short of the value of 3.0 usually 
accepted as evidence of linkage. To prove linkage, we would 
need to examine data from additional families. LOD scores 
obtained from individual families can be added together to 
obtain an overall score. (Note that, mathematically, adding LOD 


scores is the same as multiplying the odds of linkage in each 
family together and then taking the logarithm of the result. This 
is another example of using the multiplication rule to assess 
the probability of co-occurrence.) 

Suppose that a recombination had occurred in the meiosis 
producing III-5, the fifth child in generation III (i.e., she would 
retain the same marker genotype but would be affected with 
the disease rather than unaffected). This event is impossible 
under the hypothesis that 0 = 0.0, so the numerator of the 
likelihood ratio becomes zero, and the LOD score for 0 = 0.0 
is -oo. It is possible, however, that the marker and disease loci 
are still linked, but at a recombination frequency greater than 
zero. Let us test, for example, the hypothesis that 0 = 0.1. This 
hypothesis predicts that the disease allele, N, will be transmit¬ 
ted along with marker allele 2 90% of the time and with SNP 
allele 7 10% of the time (i.e., when a recombination has 
occurred). By the same reasoning, the normal allele, n, will be 
transmitted with SNP allele 7 90% of the time and with SNP 
allele 210% of the time. As in the previous example, the father 
can transmit either the normal allele or the disease allele with 
equal probability (0.5) to each child. Thus, the probability of 
inheriting the disease allele with SNP allele 2 (haplotype A/2) is 
0.5 X 0.90 = 0.45, and the probability of inheriting the disease 
allele with SNP allele 7 (haplotype N1) is 0.5 x 0.1 = 0.05. The 
probability of inheriting the normal allele with marker 7 (n1) is 
0.45, and the probability of inheriting the normal allele with 
marker 2 (n2) is 0.05. In either case, then, the probability of 
receiving a nonrecombination (N2 or n1) is 0.45, and the prob¬ 
ability of receiving a recombination (N1 or n2) is 0.05. We know 
that four of the children in generation III are nonrecombinants, 
and each of these events has a probability of 0.45. We know 
that one individual is a recombinant, and the probability of this 
event is 0.05. The probability of four nonrecombinations and 
one recombination occurring together in generation III is 
obtained by applying the multiplication rule: 0.45'' x 0.05. This 
becomes the numerator for our LOD score calculation. As 
before, the denominator (the likelihood that 0 = 0.5) is (1/4)®. 
The LOD score for 0 = 0.1, then, is given by logio[(0.45'' x 0.05)/ 
(1/4)®] = 0.32. 

To test the hypothesis that 0 = 0.2, the approach just outlined 
is used again, with 0 = 0.2 instead of 0 = 0.1. This yields a LOD 
score of 0.42. It makes sense that the LOD score for 0 = 0.2 
is higher than that for 0 = 0.1, because we know that one of 
five children (0.2) in generation III is a recombinant. Applying 
this formula to a series of possible 0 values (0, 0.1, 0.2, 0.3, 
0.4, and 0.5) shows that 0.2 yields the highest LOD score, as 
we would expect: 


0 

0 
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LOD 
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0.32 
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0.0 


Sometimes the linkage phase in a pedigree is not known. For 
example, if the grandparents in the figure above had not been 
typed, we would not know the linkage phase of the father in 
generation II. It is equally likely that his haplotypes are N2/n1 
or N1/n2 (i.e., each combination has a probability of 1/2). Thus, 
we need to take both possibilities into account. If he has the 
N2/n1 haplotypes, then the first four children are nonrecombi¬ 
nants, each with a probability of (1 - 0)/2, and the fifth child is 
a recombinant, with a probability of 0/2 (using the reasoning 
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BOX 8-1 Estimating LOD Scores in Linkage Analysis—cont'd 


outlined previously). Under the hypothesis that 9 = 0.1, the 
overall probability that the father has haplotypes N2/nl and that 
the five children have the observed genotypes is 1/2(0.45'' x 
0.05) = 0.001. We now need to take the alternative phase into 
account (i.e., that the father has haplotypes N1/n2). Here, the 
first four children would each be recombinants, with probability 
9/2, and only the fifth child would be a nonrecombinant, with 
probability (1 - 9)/2. The probability that the father has the A/7/ 
n2 haplotypes and that the children have the observed geno¬ 
types is 1/2(0.45 X 0.05") = 0.000001. This probability is con¬ 
siderably smaller than the probability of the previous phase, 
which makes sense when we consider that under the hypoth¬ 
esis of linkage at 9 = 0.1, four of five recombinants is an unlikely 
outcome. We can now consider the probability of either linkage 
phase in the father by adding the two probabilities together: 
1/2(0.45" X 0.05) + 1/2(0.45 x 0.05"). This becomes the numera¬ 
tor for the LOD score calculation. As before, the denominator 
(i.e., the probability that 9 = 0.5) is simply (1/4)" = 1/1024. Then, 
the total LOD score for unknown linkage phase at 9 = 0.1 is 
logio[(1/2[0.45" X 0.051 + 1/2(0.45 x 0.05"1)/(1/1024)] = 0.02. As 
before, we can estimate LOD scores for each recombination 
frequency: 


9 

0 
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0.02 

0.12 
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0.0 


Notice that each LOD score is lower than the corresponding 
score when linkage phase is known. This follows from the fact 
that a known linkage phase contributes useful information to 
allow more accurate estimation of the actual genotypes of the 
offspring. 

LOD scores are often graphed against the 9 values, as shown 
in the figure below. The highest LOD score on the graph Is the 

maximum likelihood estimate of 9. That is, it is the most 
likely distance between the two loci being analyzed. 
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The logarithm of the odds (LOD) score (y-axis) is plotted 
against the recombination frequency (x-axisj to demon¬ 
strate the most likely recombination frequency for a pair 
of loci. 


In practice, the analysis of human linkage data is not as 
simple as in these examples. Penetrance of the disease- 
causing gene may be incomplete, recombination frequencies 
differ between the sexes, and the mode of inheritance of the 
disease may be unclear. Consequently, linkage data are ana¬ 
lyzed using one of several available computer software pack¬ 
ages. Many of these packages also allow one to carry out 
multipoint mapping, an approach In which the map locations 
of several markers are estimated simultaneously. 


and the disease-causing gene. Eventually this exercise will 
reveal linkage between the disease-causing gene and a marker 
or group of markers. Because of the large size of the human 
genome, thousands of markers are typically evaluated to find 
one or several that are linked to the disease-causing gene. 
Many important hereditary diseases have been localized 
using this approach, including cystic fibrosis, Huntington 
disease, Marfan syndrome, and neurofibromatosis type 1 
(NFl). 

Until the 1980s, linkage analyses had little chance of 
success because there were only a few dozen useful polymor¬ 
phic markers in the entire human genome. Thus, it was 
unlikely that a disease-causing gene would be located near 
enough to a marker to yield a significant linkage result. This 
situation changed dramatically after thousands of new poly¬ 
morphic markers were discovered throughout the genome 
(see Chapter 3). With efficient genotyping techniques and 
large numbers of markers, it is now commonplace to map a 
disease-causing gene very quickly. 


To be useful for gene mapping, marker loci should be 
highly polymorphic (i.e., the locus has many different alleles 
in the population). A high degree of polymorphism ensures 
that most parents will be heterozygous for the marker locus, 
making it easier to establish linkage phase in families. STRs 
typically have many alleles and are easy to assay; they are thus 
especially well suited to gene mapping. In addition, marker 
loci should be numerous, so that close linkage to the disease- 
causing gene is likely. Thousands of STRs and millions of 
SNPs have now been identified throughout the genome, so 
this requirement has been fulfilled. Each chromosome is now 
saturated with markers (Eig. 8-5). 

An example illustrates the importance of numerous, poly¬ 
morphic markers. Consider the pedigree in Figure 8-6A. The 
affected man is a homozygote for a two-allele marker locus 
that is closely linked to the disease locus. The man’s wife is a 
heterozygote for the marker locus. Their affected daughter is 
homozygous for the marker locus. Based on these genotypes, 
it is impossible to determine linkage phase in this generation. 
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FIG 8-5 A genetic map of chromosome 9, showing the locations of a large number of 
polymorphic markers. Because recombination rates are usually higher in female meiosis, the 
distances between markers (in centimorgans) are larger for females than for males. 
(From Attwood J, Chiano M, Collins A, et al. CEPH consortium map of chromosome 9. Genomics 
1994:19:203-214.) 


so we cannot predict which children will be affected with the 
disorder and which will not. The mating in generation I is 
called an uninformative mating. In contrast, a marker 
locus with six alleles has been typed in the same family 
(see Fig. 8-6B). Because the mother in generation 1 has two 
alleles that differ from those of the affected father, it is now 
possible to determine that the affected daughter in generation 
11 has inherited the disease-causing allele on the same copy 
of the chromosome that contains marker allele 1. Because she 
married a man who has alleles 4 and 5, we can predict that 
each offspring who receives allele 1 from her will be affected, 
and each one who receives allele 2 will be unaffected. 
Exceptions will be due to recombination. This example 


demonstrates the value of highly polymorphic markers for 
both linkage analysis and diagnosis of genetic disease (see 
Chapter 13). 

To be useful in gene mapping, linked markers 
should be numerous and highly polymorphic. 
A high degree of polymorphism in the marker 
locus increases the probability that matings 
will be informative for linkage analysis. 

The availability of many highly polymorphic markers 
throughout the genome helps researchers to narrow the loca¬ 
tion of a gene by direct observation of recombinations within 
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1,1 1,2 1,2 1,1 1,4 2,4 1,5 2,4 


A Uninformative mating B Informative mating 

FIG 8-6 An autosomal dominant disease-causing gene is segregating in this family. A, A closely 
linked two-allele marker polymorphism has been typed for each member of the family, but 
linkage phase cannot be determined (uninformative mating). B, A closely linked six-allele short 
tandem repeat polymorphism (STRP) has been typed for each family member, and linkage phase 
can now be determined (informative mating). 


families. Suppose that a series of marker polymorphisms, 
labeled A, B, C, D, and E, are all known to be closely linked 
to a disease-causing gene. The family shown in Figure 8-7 has 
been typed for each marker, and we observe that individual 
11-2 carries marker alleles A 2 , B 2 , C 2 , D 2 , and E 2 on the same 
copy of the chromosome that contains the disease-causing 
mutation (linkage phase). The other (normal) copy of this 
chromosome in individual II-2 carries marker alleles Aj, 

Cl, Di, and £j. Among the affected offspring in generation III, 
we see evidence of two recombinations. Individual III-2 
clearly inherited marker allele Aj from her affected mother 
(II-2), but she also inherited the disease-causing mutation 
from her mother. This tells us that there has been a recombi¬ 
nation (crossover) between marker A and the disease-causing 
gene. Thus, we now know that the region of the chromosome 
between marker A and the telomere cannot contain the 
disease-causing gene. 

We observe another recombination in the gamete trans¬ 
mitted to individual III-5. In this case, the individual inher¬ 
ited markers Di and Ej but also inherited the disease-causing 
mutation from II-2. This indicates that a crossover occurred 
between marker locus D and the disease-causing locus. We 
now know that the region between marker D and the centro¬ 
mere (including marker E) cannot contain the disease-causing 
locus. These two key recombinations have thus allowed us to 
substantially narrow the region that contains the disease- 
causing locus. Additional analyses in other families could 
narrow the location even further, provided that additional 
recombinations can be observed. In this way, it is often pos¬ 
sible to narrow the location of a disease-causing locus to a 
region that is several centimorgans or so in size. 

Sometimes, a linkage analysis produces a total LOD score 
close to zero. This could mean simply that the pedigrees are 
uninformative (a LOD score of zero indicates that the likeli¬ 
hoods of linkage and nonlinkage are approximately equal. 


because 10“ = 1). However, a total LOD score of zero can also 
result when one subset of families has positive LOD scores 
(indicating linkage) and another subset has negative LOD 
scores (arguing against linkage). This result would provide 
evidence of locus heterogeneity for the disease in question 
(see Chapter 4). For example, osteogenesis imperfecta type I 
may be caused by mutations on either chromosome 7 or 
chromosome 17 (see Chapter 4). A study of families with this 
disease could show linkage to markers on chromosome 17 in 
some families and linkage to chromosome 7 in others. Linkage 
analysis has helped to define locus heterogeneity in a large 
number of diseases, including retinitis pigmentosa, a major 
cause of blindness (Clinical Commentary 8-1). 

The direct observation of recombinations 
between marker loci and the disease-causing 
locus can help to narrow the size of the region 
that contains the disease-causing locus. In 
addition, linkage analysis sometimes shows 
that some affected families demonstrate 
linkage to markers in a given chromosome 
region and others do not. This is an indication 
of locus heterogeneity. 

Linkage Disequilibrium: Nonrandom Association 
of Alleles at Linked Loci 

Within families, one allele of a marker locus will usually be 
transmitted along with the disease-causing allele if the marker 
and disease loci are linked. For example, allele 1 of a linked 
two-allele marker could co-occur with the Huntington disease 
(HD) allele, located on chromosome 4, in a family. This asso¬ 
ciation is part of the definition of linkage. However, if one 
examines a series of families for linkage between HD and the 
marker locus, allele 1 will co-occur with the disease in some 
families, and allele 2 of the marker will co-occur with the 
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FIG 8-7 A family in which markers A, B, C, D, and E have been typed and assessed for linkage 
with an autosomal dominant disease-causing mutation. As explained in the text, recombination 
is seen between the disease locus and marker A in individual III-2 and between the disease 


locus and marker D in individual III-5. 
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CLINICAL COMMENTARY 8-1 

Retinitis Pigmentosa: A Genetic Disorder Characterized by Locus Heterogeneity 


Retinitis pigmentosa (RP) describes a collection of inherited 
retinal defects that together are the most common inherited 
cause of human blindness, affecting 1 in 3000 to 1 in 4000 
persons. The first clinical signs of RP are seen as the rod pho¬ 
toreceptor cells begin to die, causing night blindness. Rod 
electroretinogram (ERG) amplitudes are reduced or absent. 
With the death of rod cells, other tissue begins to degenerate 
as well. Cone cells die, and the vessels that supply blood to 
the retinal membranes begin to attenuate. This leads to a 
reduction in daytime vision. Patients develop tunnel vision, and 
most are legally blind by 40 years of age. The name retinitis 
pigmentosa comes from the pigments that are deposited on 
the retinal surface as pathological changes accumulate (Fig. 
8-8). RP is neither preventable nor curable, but there is evi¬ 
dence that its progress can be slowed somewhat by dietary 
supplementation with vitamin A. 

RP is known to be inherited in different families in an auto¬ 
somal dominant, autosomal recessive, or X-linked recessive 
fashion. These modes of inheritance account for approximately 
30% to 40%, 50% to 60%, and 5% to 15% of RP cases, 
respectively. In addition, a small number of cases are caused 
by mitochondrial mutations, and one form of RP is caused by 
the mutual occurrence of mutations at two different loci 



FIG 8-8 A fundus photograph illustrating clumps of pigment 
deposits and retinal blood vessel attenuation in retinitis 
pigmentosa. (Courtesy Dr. Donnell J. Creel, University of 
Utah Health Sciences Center.) 


{peripherin/RDS and ROM1, both of which encode structural 
components of photoreceptor outer segment disc mem¬ 
branes). This mode of inheritance is termed digenic. Genetic 
studies have demonstrated that mutations in any of 56 differ¬ 
ent genes can cause non-syndromic RP (i.e., RP that does not 
occur as part of an identified syndrome), making this disease 
an example of extensive locus heterogeneity. 

An early linkage analysis mapped an autosomal dominant 
form of RP to the long arm of chromosome 3. This was a 
significant finding, because the gene, RHO, that encodes rho- 
dopsin had also been mapped to this region. Rhodopsin is the 
light-absorbing molecule that initiates the process of signal 
transduction in rod photoreceptor cells. Thus, RHO was a rea¬ 
sonable candidate gene (see text) for RP. Linkage analysis was 
performed using a polymorphism located within the RHO, and 
a LOD score of 20 was obtained for a recombination frequency 
of zero in a large Irish pedigree. Subsequently, more than 100 
different mutations in RHO have been shown to cause RP, 
confirming the role of this locus in causing the disease. RHO 
mutations are estimated to account for 25% of autosomal 
dominant RP cases and about 10% of all RP cases. 

Additional studies have identified mutations in genes involved 
in many different aspects of retinal degeneration. Some of 
these genes encode proteins involved, for example, in visual 
transduction (e.g., rhodopsin, the a subunit of the rod cyclic 
guanosine monophosphate [cGMP] cation-gated channel 
protein, and the a and (3 subunits of rod cGMP phosphodies¬ 
terase): photoreceptor structure (e.g., peripherin/RDS and 
ROMI)] and retinal protein transport (e.g., ABCR). Additional 
genes have been implicated in syndromes that include RP as 
one feature. For example, RP is seen in Leber congenital amau¬ 
rosis (LCA), the most common hereditary visual disorder of 
children. About 10% to 20% of persons with RP have Usher 
syndrome, which has a number of subtypes and typically also 
involves vestibular dysfunction and sensorineural deafness. 
Another 5% of RP cases occur as part of the Bardet-BiedI 
syndrome, in which mental retardation and obesity are also 
observed. 

Collectively, the 56 genes identified thus far in causing RP 
account for about 60% of all disease cases. Additional studies 
of this genetically heterogeneous disorder will doubtless 
uncover additional genes, further enhancing our understanding 
of the causation of this disorder. 


disease in others (Fig. 8-9). This reflects two things. First, 
disease-causing mutations might have occurred numerous 
times in the past, sometimes on a copy of chromosome 4 
carrying marker allele 1 and other times on a copy of chro¬ 
mosome 4 carrying marker allele 2. Second, even if the disease 
is the result of only one original mutation, crossovers occur¬ 
ring through time will eventually result in recombination of 
the marker and disease alleles. A disease-causing allele and a 
linked marker allele will thus be associated within families but 
not necessarily between families. In other words, if we examine 
a marker locus and a disease-causing locus in a large series of 


families in a population, we do not necessarily expect that 
one specific marker allele will be associated with the disease- 
causing mutation in most or all families. 

Sometimes, however, we do observe preferential associa¬ 
tion of a specific marker allele and the disease-causing allele 
in a population. The chromosome haplotype consisting of 
one marker allele and the disease-causing allele is found more 
often than we would expect based on the frequencies of the 
two alleles in the population. Suppose, for example, that the 
disease-causing allele has a frequency of 0.1 in the population 
and the frequencies of the two alleles (labeled 1 and 2) of the 
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A 1,1 1,2 1,1 1,1 1,2 B 1,2 1,1 1,2 1,1 

FIG 8-9 A, In this family, allele 1 of a marker polymorphism is in linkage phase with the disease- 
causing allele (i.e., both alleles are on the same copy of chromosome 4). B, In a second family, 
allele 2 of the same marker polymorphism is in linkage phase with the disease-causing allele. 
This difference in families can reflect an earlier recombination event between the marker locus 
and the disease-causing locus, or it can reflect the occurrence of two different mutation events 
in the ancestors of the two families. 


marker locus are 0.4 and 0.6, respectively. Assuming statistical 
independence between the two loci (i.e., linkage equilib¬ 
rium), the multiplication rule predicts that the population 
frequency of the haplotype containing both the disease- 
causing allele and marker allele 1 would be 0.1 X 0.4 = 0.04. 
By collecting family information, we can directly count the 
haplotypes in the population. If we find that the actual fre¬ 
quency of this haplotype is 0.09 instead of the predicted 0.04, 
then the assumption of independence has been violated, indi¬ 
cating preferential association of marker allele 1 with the 
disease allele. This association of alleles at linked loci is 
termed linkage disequilibrium. 

Figure 8-10 illustrates how linkage disequilibrium can 
come about. Imagine two marker loci that are both linked to 
the myotonic dystrophy locus on chromosome 19. Marker B 
is closely linked, at less than 1 cM away. Marker A is less 
closely linked, at about 5 cM away. Because each of these 
marker loci has two alleles (denoted 1 and 2), there are four 
possible combinations of marker alleles at the two loci, as 
shown in Figure 8-10. When a new myotonic dystrophy 
mutation first occurs in a population, it can be found on only 
one copy of a chromosome, in this case the one with the AjB^ 
marker combination. As the disease-causing mutation (allele) 
is passed through multiple generations, crossovers will occur 
between it and the two markers. Because the disease locus is 
more closely linked to marker B than marker A, fewer cross¬ 
overs will occur between the disease-causing allele and marker 
B. As a result, the disease-causing allele is found on a B 2 - 
containing chromosome 90% of the time, and it is found on 
an Aj-containing chromosome 72% of the time. The degree 
of linkage disequilibrium is stronger between marker B and 
the disease-causing allele than between marker A and the 
disease-causing allele. Notice also that both the Aj and the B 2 
alleles are still positively associated with the disease-causing 
allele, because each marker allele has a much lower frequency 
(50%) in the population of individuals who lack the disease- 
causing allele (see Fig. 8-10). If enough generations elapse, 
recombination would eventually eliminate the allelic 


associations completely, and the loci would be in linkage 
equilibrium. 

Because linkage disequilibrium is a function of the dis¬ 
tance between loci, it can be used to help infer the order of 
genes on chromosomes. Linkage disequilibrium provides an 
advantage over linkage analysis in that it reflects the action 
of recombinations that have occurred during dozens or hun¬ 
dreds of past generations (i.e., the number of generations that 
have elapsed since the disease-causing mutation first occurred 
in a population). Linkage analysis, in contrast, is limited to 
the recombinations that can be directly observed in only the 
past several generations. Consequently, there are seldom 
enough recombinants in a series of families to map a gene to 
a region smaller than several centimorgans using linkage 
analysis, whereas linkage disequilibrium analysis can some¬ 
times map a gene to an interval of 0.1 cM or less. Flowever, 
linkage disequilibrium can be influenced by evolutionary 
forces, such as natural selection or genetic drift, that have 
acted during the history of a population. For example, some 
loci in the major histocompatibility complex on chromosome 
6 (see Chapter 9) are in disequilibrium, possibly because 
certain allelic combinations confer a selective advantage for 
immunity to some diseases. 

Linkage disequilibrium is the nonrandom 
association of alleles at linked loci. Linkage 
disequilibrium between loci diminishes 
through time as a result of recombination. It 
can be used to infer the order of genes on 
chromosomes. 

Linkage Versus Association in Populations 

The phenomena of linkage and association are sometimes 
confused. Linkage refers to the positions of loci on chromo¬ 
somes. When two loci are linked, specific combinations of 
alleles at these loci will be transmitted together within fami¬ 
lies because they are located together on the same chromo¬ 
some. But, as the HD example cited previously showed, the 
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FIG 8-10 Linkage disequilibrium between the myotonic dystrophy (DM) locus and two linked 
loci, A and B. The DM mutation first arises on the chromosome with the A-iB 2 haplotype. After 
a number of generations have passed, most chromosomes carrying the DM mutation still have 
the A ,62 haplotype, but, as a result of recombination, the DM mutation is also found on other 
haplotypes. Because the haplotype is seen in 70% of DM chromosomes but only 25% of 
normal chromosomes, there is linkage disequilibrium between DM and loci A and B. Because 
locus B is closer to DM, it has greater linkage disequilibrium with DM than does locus A. 
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specific combinations of alleles transmitted together can vary 
from one family to another. Association, on the other hand, 
refers to a statistical relationship between two traits in the 
general population. The two traits occur together in the same 
individual more often than would be expected by chance 
alone. 

As just discussed, the alleles of two linked loci may be 
associated in a population (because of linkage disequilibrium, 
which is a form of association). In this case, a population 
association can lead to mapping of a disease-causing gene. An 


example is given by hereditary hemochromatosis, an autoso¬ 
mal recessive disorder discussed in Chapter 7. One associa¬ 
tion study showed that 78% of patients with hemochromatosis 
had the A3 allele of the human leukocyte antigen (HLA) A 
locus (see Chapter 9 for further discussion of the HLA 
system), whereas only 27% of unaffected subjects (controls) 
had this allele. This strong statistical association prompted 
linkage analyses using HLA polymorphisms and led to the 
mapping of the major hemochromatosis-causing gene to a 
region of several centimorgans on chromosome 6. The large 
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size of this region made it very difficult to pinpoint a specific 
gene. Linkage disequilibrium analysis was subsequently used 
to narrow the region to approximately 250 kb, prompting the 
ready identification of an HLA-like gene (HFE) in which a 
single mutation is responsible for the great majority of cases 
of hereditary hemochromatosis. The association between 
this hemochromatosis-causing gene and the linked HLA-A 
locus is probably the result of a recent hemochromatosis- 
causing mutation that occurred on a chromosome copy 
that contained the HLA-A3 allele. Because the mutation 
occurred only 50 to 100 generations ago, linkage disequilib¬ 
rium is still seen between the HLA-A3 allele and the major 
hemochromatosis-causing mutation. 

Population associations can also result from a causal rela¬ 
tionship between an allele and a disease condition. An 
example of such an association involves ankylosing spondy¬ 
litis, a disease that primarily affects the sacroiliac joint (Fig. 
8-11). Inflammation of the ligaments leads to their ossifica¬ 
tion and eventually to fusion of the joints (ankylosis). The 
HLA-B27 allele is found in about 90% of European Ameri¬ 
cans who have ankylosing spondylitis but in only 5% to 10% 
of the general European American population. Because the 
population incidence of ankylosing spondylitis is quite low 
(<1%), most people who have the HLA-B27 allele do not 
develop ankylosing spondylitis. Elowever, those who have the 
HLA-B27 allele are about 90 times more likely to develop the 
disease than are those who do not have it (i.e., 9% of persons 
with HLA-B27 shown in Table 8-1 have ankylosing spondy¬ 
litis, and only about 0.1% of those without HLA-B27 have the 
disease). Because of this strong association, a test for 



FIG 8-11 Ankylosing spondylitis, caused by ossification of 
discs, joints, and ligaments in the spinal column. Note the 
characteristic posture. (Modified from Mourad LA. Orthope¬ 
dic Disorders. St Louis: Mosby; 1991.) 


HLA-B27 is sometimes included as part of the diagnosis of 
ankylosing spondylitis. Because ankylosing spondylitis is 
thought to be an autoimmune disorder, the association might 
reflect the fact that the HLA system is a key element in the 
body’s immune response (see Chapter 9). 

A population association is also seen between a nucleotide 
variant in the HLA-DQfi locus and type 1 diabetes (see 
Chapter 12). Because autoimmunity is a factor in the etiology 
of type 1 diabetes, there is likely a causative relationship 
between the HLA-DQfi locus and increased susceptibility to 
this form of diabetes. 

Population association refers to the nonran¬ 
dom co-occurrence of factors at the popu¬ 
lation level. Associations are distinguished 
from linkage, which refers to the positions of 
loci on chromosomes. Linkage disequilibrium 
is a special case of association in which there 
is nonrandom association of specific alleles at 
linked loci. 

PHYSICAL MAPPING AND CLONING 

Linkage analysis allows us to determine the relative distances 
between loci, but it does not assign specific chromosome 
locations to markers or disease-causing genes. Physical 
mapping, which has involved a variety of methods, accom¬ 
plishes this goal, and considerable progress has been made in 
developing high-resolution physical mapping approaches. 

Chromosome Morphology 

A simple and direct way of mapping disease-causing genes is 
to show that the disease is consistently associated with a cyto¬ 
genetic abnormality, such as a duplication or deletion. Such 
abnormalities might have no clinical consequences them¬ 
selves (thus serving as a marker), or they might cause the 
disease. Because these approaches are historically the oldest 
of the physical mapping approaches, they are discussed first. 

Deletions 

Karyotypes or array CGEl (see Chapter 6) of patients with a 
genetic disease occasionally reveal deletions of a specific 
region of a chromosome. This provides a strong hint that the 


TABLE 8-1 Association of Ankylosing 
Spondylitis and the HLA-B27 Allele in a 
Hypothetical Population 


ANKYLOSING SPONDYLITIS 
HLA-B27 _PRESENT_ABSENT 

Present 90 1000 

Absent 10 9000 

*This table shows that persons with ankylosing spondylitis 
are much more likely to have the HLA-B27 allele than are 
normal controls. 
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locus causing the disease might lie within the deleted region. 
The extent of a deletion can vary in several patients with the 
same disease. Deletions are compared in many patients to 
define the region that is deleted in all patients, thereby nar¬ 
rowing the location of the gene (Fig. 8-12). Deletion mapping 
has been used, for example, in locating the genes responsible 
for retinoblastoma (see Chapters 4 and 11), Prader-Willi and 
Angelman syndromes (see Chapter 5), and Wilms tumor, a 
childhood kidney tumor that can be caused by mutations on 
chromosome 11. In contrast to the heteromorphisms just 
discussed, deletions of genetic material are the direct cause of 
the genetic disease. 

These deletions affect only one member of the homolo¬ 
gous pair of chromosomes, making the patient heterozygous 
for the deletion. If a region large enough to be microscopi¬ 
cally observable were missing from both chromosomes, it 
would usually produce a lethal condition. 

Translocations 

As discussed in Chapter 6, balanced chromosome transloca¬ 
tions often have no effect on a translocation carrier because 
the individual still has a complete copy of his or her genetic 
material. However, when a translocation happens to interrupt 



FIG 8-12 Localization of a disease-causing gene through 
deletion mapping. A series of overlapping deletions is studied 
in which each deletion produces the disease phenotype. The 
region of overlap of all deletions defines the approximate 
location of the gene. 


a gene, it can produce genetic disease. For example, after 
linkage analysis had mapped the NFl gene approximately to 
the long arm of chromosome 17, a more refined location was 
obtained when two patients were identified—one with a bal¬ 
anced translocation between chromosomes 17 and 22, and 
the other with a balanced translocation between chromo¬ 
somes 17 and 1. The breakpoints of these translocations on 
chromosome 17 were located very close to each other, in the 
same region implicated by linkage analysis. They provided a 
physical starting point for experiments that subsequently led 
to identification of the NPl gene. 

A similar example is provided by translocations observed 
between the X chromosome and autosomes in females with 
Duchenne muscular dystrophy (DMD). Because this is a 
lethal X-linked recessive disorder, affected homozygous 
females are rare. The translocation breakpoint on the X chro¬ 
mosome was found to be in the same location (Xp21) in 
several affected females, suggesting that the translocation 
interrupted the DMD gene. This proved to be the case, and 
these translocations aided considerably in mapping and iden¬ 
tifying the DMD gene. (Although these females also carried 
a normal X chromosome, the normal X was preferentially 
inactivated, leaving only the interrupted X as the active 
chromosome.) 

Dosage Mapping Using Deletions 
and Duplications 

When a deletion occurs on a chromosome, it stands to reason 
that the protein products encoded by genes in the deleted 
region will be present in only half the normal quantity. This 
is the basis of a simple approach known as dosage mapping. 
For example, it was observed that a 50% reduction in the level 
of the enzyme adenylate kinase was consistently associated 
with a deletion on chromosome 9, mapping the adenylate 
kinase gene to this chromosome region. 

Similarly, a duplication of chromosome material should 
be associated with an increase in gene product levels. Because 
three genes are present instead of two, the increase should be 
approximately 50% above normal. This form of dosage 
mapping was used to assign the gene encoding superoxide 
dismutase-1 (SOD-1) to the long arm of chromosome 21. 

A gene can be physically mapped to a chro¬ 
mosome region by associating cytogeneti¬ 
cally observable variations (heteromorphisms, 
translocations, deletions, duplications) with 
gene expression (including the presence of a 
genetic disease). 

Positional Cloning 

Sometimes the gene product responsible for a genetic disease 
is known before the gene itself is identified. This was the case, 
for example, with the jJ-globin polypeptide and sickle cell 
disease. In such cases, one can deduce the DNA sequence 
from the amino acid sequence of the polypeptide; this DNA 
sequence can be used to make a probe in order to locate the 
disease-causing gene. This type of approach, in which the 
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gene product and its function are used to pinpoint the gene, 
is an example of functional cloning (i.e., identifying a spe¬ 
cific gene or gene segment and making copies of it for study 
[see Chapter 3, Box 3-1]). 

More often, however, we have only a linkage result that has 
localized the disease-causing gene to a region near the linked 
marker polymorphism (the locations of these markers have 
been established previously). Because of the limited resolu¬ 
tion of linkage analysis, the region that contains the disease- 
causing locus may be several megabases or larger and can 
easily contain dozens of genes interspersed with noncoding 
DNA (Fig. 8-13). A common approach has been to begin with 
a linked marker and then canvas the region around the 
marker to locate and identify the disease-causing gene itself. 
Because this process begins with an approximate knowledge 
of the gene’s position on a chromosome, it has traditionally 
been termed positional cloning. 

Suppose that we know the approximate location of a 
disease-causing gene, determined through linkage analysis. 
The region that contains the disease-causing gene may be 
defined by the recombination rate between the gene and a 
marker polymorphism (typically one to several centimor- 
gans). Its boundaries are more commonly defined by markers 
that have not undergone observable recombination with the 
gene (see Fig. 8-7). The disease-causing gene could be located 
anywhere in this region, so the region’s DNA sequence must 
be analyzed and evaluated to pinpoint the sequence corre¬ 
sponding to the gene. Until the human genome was fully 
sequenced (Box 8-2), this was a daunting task that could 
require years of work. The finished human genome sequence 
is now available in computerized databases, so investigators 
often scan a region of interest simply by accessing the appro¬ 
priate DNA sequence on a computer. 

As we scan the DNA region containing a disease-causing 
gene, how do we know when we have actually reached the 
gene? Coding DNA (i.e., DNA that encodes proteins) must 
be distinguished from noncoding DNA, and the likely func¬ 
tion of each gene in the region must be determined. Several 
approaches may be used to accomplish these tasks. 

Functional versus Nonfunctional DNA 

Most of our DNA sequence has no known function and is 
unlikely to contribute to disease. Thus, in searching for 
disease-causing alterations, we typically focus on DNA that 


encodes proteins or performs important regulatory functions 
(e.g., enhancer or promoter sequences). Because of their 
functional significance, coding DNA or regulatory DNA 
sequences generally cannot change very much through the 
course of evolution. This means that such DNA sequences 
will be conserved, or similar in base-pair sequence, in many 
different species. In contrast, nonfunctional DNA sequences 
are likely to change more rapidly and to differ substantially 
among species. DNA sequences can be compared among 
species using computer algorithms (see below) to distinguish 
functional (conserved) DNA from nonfunctional (noncon- 
served) DNA. As the genomes of more and more species are 
sequenced, these comparisons are done increasingly by com¬ 
paring whole-genome sequences among species. Such com¬ 
parisons have shown that about 5% of the noncoding human 
genome is highly conserved and is therefore likely to perform 
regulatory functions. 

As discussed in Chapter 3, most CG dinucleotides are 
methylated. However, approximately 60% of human genes 
have unmethylated CG dinucleotides (CG islands) in their 5' 
region. (A lack of methylation in the 5' region of the gene 
probably makes it more accessible to transcription factors 
required for active expression.) The identification of a series 
of CG islands has often been used to pinpoint the locations 
of coding genes. 

The identification of DNA sequences that are 
highly conserved across multiple species, and 
the identification of unmethylated CG islands, 
are both ways to distinguish coding or regula¬ 
tory (functional) DNA from nonfunctional DNA. 

Computer Analysis of DNA Sequence 

Sophisticated computer algorithms can test a DNA sequence 
for patterns that signal a coding gene (e.g., transcription ini¬ 
tiation sites, stop codons, intron-exon boundaries). This 
approach was used, for example, to help identify and charac¬ 
terize one of the adult polycystic kidney disease genes {PKDl; 
see Chapter 4). In addition, these algorithms can often rec¬ 
ognize patterns typical of genes that encode specific classes 
of proteins (e.g., transcription factors, transmembrane 
proteins). 

Computerized databases of known DNA sequences also 
play an important role in gene identification. When studying 



/ 


Linked marker location 


Region implicated by linkage analysis 


FIG 8-13 In a typical linkage analysis, a marker polymorphism is identified that is closely linked 
to the disease-causing gene. The chromosome location of the marker polymorphism has been 
established by previous linkage and physical mapping. The region surrounding the marker poly¬ 
morphism can contain up to several megabases of DNA sequence, and each gene in this region 
is a potential candidate as the disease-causing gene. 
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BOX 8-2 The Human Genome Project 


The Human Genome Project was one of the most widely pub¬ 
licized and ambitious undertakings in the history of biomedical 
research. Initiated in October 1990, this 15-year project had 
three major goals: a genetic marker map, a physical map, and 
the complete 3-billion bp sequence of the human genome. 

The marker map was completed early in the course of the 
project and currently includes many thousands of polymor¬ 
phisms distributed throughout the genome, such as SNPs or 
STBs. On average, a useful polymorphism can be found at 
intervals of much less than 1 cM. Thus, a closely linked marker 
can be found for virtually any disease-causing gene. In addition 
to these polymorphisms, several million SNPs have been iden¬ 
tified throughout the genome. SNPs are single-base variants 
that individually are less polymorphic than an STR polymor¬ 
phism. However, they have a lower mutation rate than these 
polymorphisms, and they are especially amenable to computer¬ 
ized automated processing (e.g., microarrays; see Chapter 3). 
They have thus added even further to the usefulness of the 
human genetic map. 

The second goal, a physical map of known sequence-tagged 
sites (STSs) distributed at 100-kb intervals throughout the 
genome, is also complete. These physical signposts were 
invaluable in positional cloning experiments, where they were 
used to place a series of DNA sequences (e.g., those inserted 
into cloning vectors such as yeast artificial chromosomes 
(YACs), bacterial artificial chromosomes [BACs], or cosmids) 
in relative order. 

The final goal, the complete genome sequence, was pursued 
in both the public and private sectors. The publicly funded 
effort began by establishing a framework of overlapping cloned 
segments of human DNA. These DNA segments, which were 
cloned into vectors such as BACs and bacteriophage PI arti¬ 
ficial chromosomes (PACs), were typically on the order of 100 
to 200 kb in size. Establishing accurate overlaps and chromo¬ 
some locations for these segments presented formidable tech¬ 
nical challenges, and the effort was aided considerably by the 
physical STS map. Each DNA segment was then broken into 
small restriction fragments and sequenced, and the resulting 
data were placed into a publicly accessible database. In con¬ 
trast, the privately funded effort began with much smaller DNA 
segments (several kb in size), cloned into plasmid vectors. 
Each of these small segments was sequenced and then 
searched for overlap in order to assemble the larger DNA 
sequence, making use of the publicly available data. 

In February 2001, both groups announced that they had com¬ 
pleted approximately 90% of the sequence of human 


euchromatic DNA (i.e., the portion of DNA that contains genes). 
A completed, highly accurate sequence, with an error rate 
lower than 1 in 10,000 bp, was unveiled in the spring of 2003, 
exactly 50 years after Watson and Crick first described the 
structure of DNA. 

Completion of this project continues to provide many bene¬ 
fits. Positional cloning, once the bane of genetics laboratories, 
is now much easier because of the existing physical maps and 
DNA sequences. The amount of time required to identify genes 
using approaches like exome and genome sequencing contin¬ 
ues to decrease, and the number of disease-causing genes that 
have been pinpointed in this way is growing each year. The 
discovery of these genes yields many important benefits: 
improved genetic diagnosis, the potential for manufacture of 
gene products by recombinant DNA techniques, and improved 
treatment through more-specific drugs or gene therapy (see 
Chapter 13). 

The same technology used to sequence the human genome 
has been applied to dozens of other organisms: medically sig¬ 
nificant viruses and bacteria, agriculturally important crops such 
as rice and maize, and important experimental organisms such 
as yeast, fruit flies, mice, and rats. Similarities between the 
genes of these organisms and humans have helped us to 
understand the nature of many human genes. 

It is a mistake to think of the completion of the human 
genome sequence as the end of an era of research. The 
genome sequence, while of immense value, is still nothing 
more than a long string of nucleotides. The challenge will be 
to use this vast pool of information to identify genes, under¬ 
stand their regulation and expression, and characterize the 
many complex interactions among genes and environment that 
ultimately give rise to phenotypes. In addition, the original 
public genome sequence represented, for each region of the 
genome, the sequence of only a single individual. (Because of 
the high degree of similarity of genes and their chromosomal 
locations in all humans, this single sequence is still highly 
useful for locating and identifying genes.) Full-scale DNA 
sequencing of many thousands of individuals, which is now 
well under way, will help to inform us about differential sus¬ 
ceptibility to genetic disease. An example is the publicly funded 
1000 Genomes Project, in which genomes from several thou¬ 
sand members of diverse human populations have been 
sequenced. The human genome sequence thus represents the 
beginning, rather than the end, of an era of fruitful and exciting 
biological research. 


a specific region of DNA to find a gene, it is common to 
search for similarity between DNA sequences from the region 
and other DNA sequences in the database. The sequences in 
the database might derive from genes with known function 
or tissue-specific expression patterns. Suppose, for example, 
that we have used linkage analysis to identify a region con¬ 
taining a gene that causes a developmental disorder such as 
a limb malformation. As we evaluate DNA sequences in the 
region, we would look for similarity between a DNA sequence 
from this region and a plausible sequence from the database 
(e.g., a sequence from a gene that encodes a protein involved 


in bone development, such as a fibroblast growth factor). 
Because genes that encode similar protein products usually 
have similar DNA sequences, a match between the sequence 
from our region and a sequence in the database could be a 
vital clue that this particular DNA sequence is actually part 
of the gene that causes the limb malformation. 

The similarity search need not be confined to human 
genes. The complete DNA sequences of many organisms, 
including chimpanzee, chicken, mouse, fruit fly, and yeast, are 
now available in computerized databases. Often, sequence 
similarity is observed with genes of known function in other 
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organisms, such as mouse or even yeast or bacteria. Because 
genes with important protein products tend to be highly 
conserved throughout evolution, the identification of a 
similar gene in another can provide important information 
about the gene’s function in the human. For example, many 
of the genes involved in cell cycle regulation are quite similar 
in yeast and human (e.g., portions of the NPl gene and the 
yeast IRA2 gene). Indeed, approximately one third of human 
disease-causing genes identified to date have similar counter¬ 
parts in yeast. These genes and their products can easily 
be manipulated in experimental organisms, so their well- 
understood functions in these organisms can provide useful 
inferences about their functions in humans. A number of 
important human disease-causing genes have been discov¬ 
ered because similar candidate genes had previously been 
identified in other organisms (e.g., the mouse “small eye” 
gene and aniridia; the mouse “splotch” gene and Waarden- 
burg syndrome; the Drosophila and mouse “patched” genes 
and basal cell nevus syndrome; and the mouse “pink eye” 
gene and oculocutaneous albinism type 2). In addition to 
providing insight about coding genes, interspecies sequence 
comparisons can also reveal highly conserved noncoding 
sequences that contain important regulatory elements. These, 
too, can contribute to disease risk. 

Many computer databases and algorithms are 
now used to infer whether DNA sequences are 
located within genes. The possible function of 
a gene can be inferred by comparing its 
sequence with that of other human or nonhu¬ 
man genes whose functions are known. 

Screen for Mutations in the Sequence 

Once a portion of coding DNA has been isolated, it can be 
examined for disease-causing mutations, usually by direct 
DNA sequencing (see Chapter 3). If a DNA sequence repre¬ 
sents the disease-causing gene, then mutations should be 
found in individuals with the disease, but they should not be 
found in unaffected individuals. To help distinguish disease- 
causing mutations from polymorphisms that vary naturally 
among individuals, it is particularly useful to compare the 
DNA of patients whose disease is caused by a new mutation 
with the DNA of their unaffected parents. While a harmless 
polymorphism will be seen in both the affected offspring and 
the unaffected parents, a mutation that is responsible for the 
disease in the offspring will not be seen in the parents. This 
approach is especially useful for identifying mutations in 
genes for highly penetrant autosomal dominant diseases such 
as NPL 

Another type of mutation that can be tested is a submi- 
croscopic deletion (i.e., a deletion too small to be observable 
under a microscope). Small deletions can be detected using 
array CGH or by high-throughput sequencing (see Chapter 
3). In the latter approach, a deletion is indicated when the 
number of “reads” in a region is substantially decreased. 
Other techniques, including fluorescent in situ hybridization 
(FISH; see Chapter 6) or multiplex ligation-dependent probe 


amplification (MLPA; see Chapter 3) can also be used in some 
cases to detect small deletions. 

Test for Gene Expression 

To help verify that a gene is responsible for a given disease, 
one can test various tissues to determine the ones in which 
the gene is expressed (i.e., transcribed into mRNA). This can 
be done by purifying mRNA from the tissue, placing it on a 
blot, and testing for hybridization with a probe made from 
the gene. This technique, known as Northern blotting (Fig. 
8-14), is conceptually similar to Southern blotting except that 
mRNA, rather than DNA, is being probed. If the gene in 
question causes the disease, the mRNA may be expressed in 
tissues known to be affected by the disease (the same reason¬ 
ing is applied to the analysis of DNA segments obtained from 


o 2 2 

"D 2 2 -9 -9 

< QQ CD U- U- 



FIG 8-14 An example of a Northern blot, showing the hybrid¬ 
ization of a cDNA probe from the EVI2A gene (a gene embed¬ 
ded within an intron of the NFl gene) with mRNA from 
adrenal gland, brain, and fibroblasts. This result indicates that 
EVI2A is expressed in the brain at a much higher level than 
in the other two tissues. (Courtesy Dr. Richard Cawthon, 
University of Utah Health Sciences Center.) 
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tissue-specific cDNA libraries). For example, one would 
expect the phenylalanine hydroxylase gene (mutations in 
which can cause phenylketonuria [PKU]) to be expressed in 
the liver, where this enzyme is known to be synthesized. 

Currently, most testing for gene expression involves the 
use of either microarrays (see Chapter 3) or high-throughput 
sequencing of RNA. In both procedures, mRNA from a tissue 
of interest is first converted to cDNA (see Box 8-3). Microar¬ 
rays containing thousands of oligonucleotide sequences 
(probes) representing genes of interest have been manufac¬ 
tured, and these can be exposed to fluorescently labeled 
cDNA from relevant tissues to determine the tissues in which 
specific genes are expressed. If the labeled cDNA from the 
tissue corresponds to a gene represented on the microarray, 
the cDNA sequence and the probe sequence will undergo 
complementary base pairing, and a fluorescent signal will be 
observed (see Chapter 3). The amount of expression of a 
given gene in a tissue is compared to the expression level in 
a known control sample, yielding a relative expression level. 
A limitation of microarray gene expression analysis is that 
only the transcripts that correspond to the probes on the 
microarray can be assessed. Novel or previously unknown 
transcripts are missed. 

High-throughput RNA sequencing (or RNA-seq, Fig. 
8-15) provides a more quantitative estimate of gene expres¬ 
sion by sequencing cDNA segments, matching them with a 
reference DNA sequence, and counting the number of times 
each segment is read. Genes that are highly expressed in a 
tissue (i.e., genes that produce a large number of mRNA 
transcripts) will produce a larger number of sequence reads. 
A gene that is highly expressed in an appropriate tissue 
becomes a good candidate for disease causation (e.g., the gene 
that encodes phenylalanine hydroxylase would have a high 
level of expression in liver tissue). 

Another form of gene expression testing involves inserting 
the normal version of the DNA sequence into a defective cell 
from an affected person (or animal model), using recombi¬ 
nant DNA techniques. If the normal sequence corrects the 
defect, then it is very likely to represent the gene of interest. 
This approach has been used, for example, to show that muta¬ 
tions in the CFTR gene can cause cystic fibrosis. 



In determining whether a gene contributes 
to the cause of a disease, a critical test is to 
screen the gene for mutations that are 
present in affected persons but absent in 
unaffected controls. Further evidence that a 
gene helps to cause a specific disease is to 
show that mRNA corresponding to that gene 
is expressed in tissues associated with or 
affected by the disease. Gene expression levels 
can be assessed using microarrays or RNA 
sequencing. 


Candidate Genes 

The gene hunting process can be expedited considerably if a 
candidate gene is available. As the name implies, this is a gene 



1 

ATCACAGTGGGACTCCATAAATTTTTTTCT 

CGAAGGACCAGCAGAAACGAGAGAAAAAAA 

GGACAGAGTCCCCAGCGGGCTGAAGAGGGG 

ATGAAACATTAAAGTCAAACAATATATGAA 

1 


Exonic reads 
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FIG 8-15 Expression analysis using RNA sequencing (RNA- 
seq). mRNA is purified from cells of a specific type and either 
analyzed directly or reverse transcribed to complementary 
DNA (cDNA), as shown in the figure. High-throughput 
sequencing technology (see Chapter 3, Figure 3-27) is used 
to generate sequence reads of approximately 100 bp in 
length, corresponding to exons in the mature mRNA. The 
number of reads for a specific exon indicates the mRNA 
expression level, as shown in the graph at the bottom of the 
figure. 


whose known protein product makes it a likely candidate for 
the disease in question. For example, the various collagen 
genes were considered to be reasonable candidates for Marfan 
syndrome because collagen is an important component of 
connective tissue. However, linkage analysis using collagen 
gene markers in Marfan syndrome families consistently 
yielded negative results. Another candidate gene emerged 
when the gene that encodes fibrillin-1 (FBNl) was identified 
on chromosome 15. Fibrillin, as discussed in Chapter 4, is also 
a connective tissue component. Linkage analysis had localized 
the Marfan syndrome gene to chromosome 15, so FBNl 
became an even stronger candidate. Analysis of FBNl muta¬ 
tions showed that they were consistently associated with 
Marfan syndrome, confirming these mutations as a cause of 
the disease. This combination of linkage analysis to identify 
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the region containing a gene followed by a search of the 
region for plausible candidate genes is sometimes called the 

positional candidate approach. 



Candidate genes are those whose characteris¬ 
tics (e.g., protein product) suggest that they 
may be responsible for a genetic disease. The 
analysis of candidate genes in a region known 
to contain the disease-causing gene is termed 
the positional candidate approach. 


GENE MAPPING BY ASSOCIATION: 
GENOME-WIDE ASSOCIATION STUDIES 

Technological developments, including microarrays (see 
Chapter 3), have enabled investigators to test for associations 
between disease phenotypes in populations and millions of 
marker loci distributed throughout the genome. These 
genome-wide association studies (GWAS) typically involve 
examining SNPs, using a microarray, in large numbers of 
affected cases. Microarrays are also used to assess copy 
number variants (CNVs, see Chapter 3), which can vary con¬ 
siderably among individuals. The allele frequency of each 
SNP among the cases is compared with the allele frequency 
of the same SNP in a sample of unaffected persons (controls) 
(Fig. 8-16). If a statistically significant difference is seen in the 
SNP frequencies in cases versus controls, then the SNP may 



DNA from cases 
and controls 
analyzed using 
SNP microarrays 


FIG 8-16 Genome-wide association study, in which the fre¬ 
quencies of single nucleotide polymorphisms (SNPs) or copy 
number variants (CNVs) are compared in samples of cases 
and unaffected controls. A SNP microarray is analyzed for 
each case and each control to provide estimates of the allele 
frequencies of each SNP in the case and control samples. 
SNPs whose allele frequencies differ significantly between 
the case and control samples may be located in or near a 
disease-causing gene. 


be located in or very near a gene or regulatory element that 
contributes to disease susceptibility. The SNP itself might 
cause disease, or it might be in linkage disequilibrium with a 
nearby variant that causes the disease. When one million 
SNPs are typed, each SNP is located, on average, only 3 kb 
from the next SNP, making it very likely that the SNP will be 
located close to a disease-causing variant. 

GWAS have been especially useful in identifying variants 
that contribute to common diseases, such as diabetes, cancers, 
and heart disease (see Chapter 12). Because such diseases are 
typically caused by multiple disease-causing loci, as well as 
nongenetic factors, traditional linkage analysis has often been 
ineffective in detecting causal loci. An advantage of GWAS is 
that no assumptions about the biology of the disease are 
made to choose which genes to study: variants near every 
gene are tested. Indeed, the results of a GWAS often point to 
new biological pathways that had not been previously sus¬ 
pected to play a role in the disease being studied. In addition, 
it is not necessary to collect family data to detect associations 
in populations (although such data can be useful). Instead, 
unrelated cases and controls, which are easier to locate and 
sample than whole families, are typically used in GWAS. 

Association studies should be interpreted cautiously, 
however, because many factors can produce spurious associa¬ 
tions between a disease and a potential risk factor. An example 
is ethnic stratification in a population. Some diseases are 
more common in certain ethnic groups, and allele frequen¬ 
cies can also differ among these groups because of their evo¬ 
lutionary histories. Thus, if one compares disease cases and 
controls without proper matching for ethnicity, a false asso¬ 
ciation, due simply to ethnic differences between the two 
groups, could be found. For example, type 2 diabetes (see 
Chapter 12) has been studied extensively among the Pima 
Native American population, where the disease is much more 
common than among European Americans. It was observed 
that the absence of a haplotype encoding human immuno¬ 
globulin G (labeled Gm3) was strongly associated with type 
2 diabetes among the Pima. This suggested initially that the 
absence of Gm3 might be involved in causing type 2 diabetes. 
However, further analysis revealed that the proportion of 
European ancestry varied substantially among members of 
the Pima population and that the Gm3 frequency also varied 
with the degree of European ancestry: Gm3 is absent in Pimas 
with no European ancestry but has a frequency of 65% among 
Europeans. Because type 2 diabetes is less common among 
Europeans, the apparent association between type 2 diabetes 
and the absence of Gm3 was most likely a consequence of the 
level of European mixture. Once the degree of European 
ancestry in the study subjects was taken into account, there 
was no evidence of an association. 

Other factors that can produce false associations include 
imprecise definition of the disease state, inadequate sample 
sizes, and improper matching of cases and controls for vari¬ 
ables such as age and sex. The inability to replicate an associa¬ 
tion in multiple study populations is an indication that the 
association may be invalid. In addition, because a typical 
GWAS compares a million or more markers in cases and 
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controls, a small fraction of markers will appear to be associ¬ 
ated with the disease just by chance. Statistical procedures can 
be used to take this into account and correct for it. 

Although GWAS typically incorporate one to several 
million SNPs distributed across the genome, human popula¬ 
tions can contain several times this number of SNPs. How 
can we be sure that the subset of SNPs used in a GWAS 
adequately accounts for all of the SNPs in the genome? Inves¬ 
tigators have applied the concept of linkage disequilibrium to 
address this issue. Suppose, for example, that SNP nucleotide 
C is known to be in very strong linkage disequilibrium with 
a nearby SNP nucleotide, T. This means that whenever a 
person has allele C, he or she almost always has allele T as 
weU (i.e., the haplotype is C/T). Thus it is not necessary to 
type both SNPs in cases and controls in a study: those who 
have C at the first SNP are also assumed to have T at the 
second SNP. By identifying sets of SNPs that are in strong 
disequilibrium, only one member of the set has to be typed 
(Fig. 8-17). This can substantially reduce the cost of a GWAS. 
A large-scale effort to identify sets of SNPs in linkage disequi¬ 
librium with one another, the International Haplotype Map 
Project (HapMap), has established linkage disequilibrium 
patterns for millions of SNPs in the genomes of African, 
Asian, and European populations. This has in turn made it 
possible for researchers to focus gene-hnding efforts on a 
reduced number of highly informative SNP markers in any 
region of the human genome. 

Most GWAS to date have been based on microarray data 
(SNPs or CNVs compared in cases and controls). This limits 
the study to the variants present on the microarray, and these 
variants typically have allele frequencies greater than 5% in 
populations. Thus, rare variants, which may contribute sub¬ 
stantially to disease causation, have typically been omitted 
from GWAS. Increasingly, however, DNA sequence data (see 
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CAG...TCTGA...CCG 

CAG...TCTGA...CCG 

TAG...TCGGA...CCC 

TAG...TCGGA...CCC 

CAG...TCTGA...CCG 

TAG...TCGGA...CCC 

FIG 8-17 DNA sequences from the same chromosome loca¬ 
tion have been examined in six individuals (one chromosome 
copy in each individual). Three polymorphic single nucleotide 
polymorphisms (SNPs; arrows) in this sequence are shown 
in red. The other nucleotide bases do not vary among indi¬ 
viduals. Because of linkage disequilibrium, alleles C, T, and G 
of the three SNPs occur together on some chromosome 
copies, and alleles T, G, and C occur together on other chro¬ 
mosome copies. Thus, it is necessary to type only one of the 
SNPs in order to know which allele an individual possesses 
at the other two SNPs. 


text below), generated for each case and control, are being 
used in association studies so that all variation, both common 
and rare, can be taken into account. 

Genome-wide association studies (GWAS) test 
for association, or linkage disequilibrium, 
between a disease and a marker (or several 
markers) by testing thousands to millions of 
markers across the genome. Typically this is 
accomplished with microarray analysis of 
disease cases and unaffected controls. As in all 
case-control studies, considerable care must 
be taken to avoid spurious results by closely 
matching cases and controls. 

EXOME AND WHOLE-GENOME SEQUENCING 

During the past two decades the number of disease-related 
genes discovered by positional cloning and candidate 
gene approaches steadily climbed. However, it became 
increasingly clear that gene identification for most mendelian 
conditions for which the causal gene remained unknown was 
intractable using these approaches. Newer technologies for 
DNA sequencing, termed massive parallel sequencing or 
next-generation sequencing, dramatically reduced costs com¬ 
pared to older methods such as Sanger sequencing (see 
Chapter 3). Development of methods for coupling targeted 
capture of thousands of regions of the human genome cor¬ 
responding to every gene with massively parallel DNA 
sequencing made it possible to cost-effectively determine 
nearly all the coding variation in an individual human 
genome, which sums to less than 2% of the genome, in a 
single experiment. This process, in which exons are targeted 
and then sequenced, is called exome sequencing. This tech¬ 
nique has become a powerful new approach for identifying 
genes underlying mendelian conditions, particularly in cir¬ 
cumstances where conventional approaches have failed. 

The basic steps required for exome sequencing are shown 
in Figure 8-18. DNA is randomly sheared, a genomic library 
of these fragments is constructed (Box 8-3), and the frag¬ 
ments are flanked by adaptors. Next, the library is enriched 
for sequences corresponding to exons by hybridization 
capture. The fragments are hybridized to biotinylated DNA 
or RNA baits in the presence of blocking oligonucleotides 
that are complementary to the adaptors. Recovery of the 
hybridized fragments by biotin-streptavidin-based pulldown 
is followed by PGR amplification and massively parallel 
sequencing of the enriched, amplihed library. Computer 
analysis of the sequencing data can then identify virtually all 
of the variants in nearly all of the protein-coding genes of an 
individual. 

On average, exome sequencing identifies about 20,000- 
25,000 single nucleotide variants per person. More than 95% 
of these variants have been observed previously and are iden¬ 
tified as polymorphisms in human populations (i.e., the fre¬ 
quency of the less common allele is greater than one percent). 
So in an individual with a mendelian condition, these are less 
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FIG 8-18 Exome sequencing. Genomic DNA is randomly 
sheared to create a library of DNA fragments that are flanked 
by adapter sequences (not shown). The library is enriched for 
sequences corresponding to exons (dark blue fragments) by 
hybridization capture. In the capture procedure, also known 
as "pull down," the fragments are hybridized to biotinylated 
DNA or RNA baits (orange fragments) in the presence of 
blocking oligonucleotides that are complementary to the 
adaptors (not shown). The hybridized DNA fragments are 
amplified by PCR and rapidly sequenced using high-throughput 
techniques (see Chapter 3). The end result is the DNA 
sequence of the great majority of the exons in the genome. 
(Modified from Bamshad MJ, Ng SB, Bigham AW, et al. 
Exome sequencing as a tool for Mendelian disease gene 
discovery. Nat Rev Genet. 2011 ;12(11 ):745-755.) 


likely to be disease-causing. This still leaves several hundred 
variants per person considered as candidates (i.e., any one of 
these variants may be the one that causes a mendelian condi¬ 
tion). There are several ways to further reduce the number of 
candidate variants and eventually identify an underlying 
causal gene. Foremost is prioritizing variants depending on 
whether they are predicted to have a functional consequence 
such as causing a stop codon, an amino acid substitution, or 
a frameshift mutation. In addition, as discussed previously, a 
variant site has higher priority if it is highly conserved across 
species because conservation implies functional significance. 
A popular subsequent step is to compare the candidate vari¬ 
ants found in an affected person to the candidate variants 
found in other persons with the same condition. Finding 
multiple novel (i.e., never observed before in humans) or 
rare variants in the same gene in multiple persons with the 
same condition suggests that the gene is causal. The use of 
pedigree information can substantially improve the ability to 
find the causal gene. Indeed, exome sequencing of as few as 
two individuals can pinpoint the causal gene for a mendelian 
condition. 

Exome sequencing of both parents and their child (i.e., 
parent-child trios) is a highly effective approach for identify¬ 
ing de novo coding mutations. Finding multiple de novo 
events in the same gene among individuals with the same 
condition is extremely unlikely unless the gene plays a role in 
causing the condition. Sequencing of parent-child trios has 


BOX 8-3 Genomic and cDNA Libraries 


DNA libraries play important roles in gene identification. A 
DNA library is much like an ordinary library, except that the 
library is composed of pieces of DNA rather than books. 

The most general type, a genomic library, consists of frag¬ 
ments of DNA that are the result of a restriction digestion or 
mechanical shearing of whole genomic DNA. In restriction 
digestion, the DNA is partially digested, so that some recog¬ 
nition sites are cleaved and others are not. This produces 
fragments that will overlap with one another. These frag¬ 
ments can then be used, for example, in exome or whole- 
genome sequencing. A genomic library contains all of the 
human genome: introns, exons, enhancers, promoters, and 
the vast stretches of noncoding DNA that separate genes. 

A cDNA (complementary DNA) library contains only the 
DNA that corresponds to exons. It is obtained by purifying 
mRNA from a specific tissue, such as liver or skeletal muscle, 
and then exposing the mRNA to an enzyme called reverse 
transcriptase. This enzyme copies the mRNA into a cDNA 
sequence that is complementary to the mRNA sequence 
(hence the term complementary DNA). cDNA libraries can be 
used to analyze tissue-specific expression of genes, as 
described in the text. 

Yet another type of DNA library is the chromosome- 
specific library. Chromosomes are sorted by a method 
called flow cytometry, which separates chromosomes 
according to the fraction of AT base pairs in each one. The 
result is a library that consists mostly of DNA from only one 
chromosome. For example, after the gene for Huntington 
disease was mapped to a region on the short arm of chromo¬ 
some 4, a library specific for that chromosome was used to 
refine the location of the gene. 


been used to find de novo mutations responsible for both rare 
mendelian conditions such as Schinzel-Giedion syndrome 
and for genetically heterogeneous disorders such as intellec¬ 
tual disability, schizophrenia, and autism. This study design 
may be particularly applicable to gene discovery in conditions 
for which most cases are sporadic (i.e., the parents are unaf¬ 
fected), and when a dominant mode of inheritance is sus¬ 
pected (e.g., when there are few instances of parent-to-child 
transmission), and/or when substantial locus heterogeneity is 
expected. 

Exome sequencing has revolutionized the discovery of 
genes underlying mendelian conditions. Nevertheless, exome 
sequencing has its limitations. For example, it is challenging 
to accurately infer the presence of small deletions and inser¬ 
tions as well as copy number variants (CNVs). Many of these 
limitations can be overcome by whole-genome sequencing. 
As the cost of whole-genome sequencing continues to drop, 
it will likely become the preferred strategy for gene discovery. 
The power of gene discovery strategies based on exome 
and whole-genome sequencing means that it is likely that in 
the not too distant future, the genetic basis of virtually all 
mendelian conditions will be known (see current examples 
in Table 8-2), and emphasis will shift further toward 
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TABLE 8-2 Examples of Well-Known Mendelian Disease-Causing Genes 

DISEASE 

CHROMOSOME LOCATION 

GENE PRODUCT 

ai-Antitrypsin deficiency 

14q32.1 

Serine protease inhibitor 

a-Thalassemia 

16p13.3 

a-Globin component of hemoglobin 

P-Thalassemia 

11p15.5 

P-Globin component of hemoglobin 

Achondroplasia 

4pl6.3 

Fibroblast growth factor receptor 3 

Albinism, oculocutaneous (type 1) 

11q14-q21 

Tyrosinase 

Albinism, oculocutaneous (type 2) 

15q11-q12 

Tyrosine transporter 

Alzheimer disease* (familial) 

14q24.3 

Presenilin 1 


1q31-q42 

Presenilin 2 


21 q21 

P-Amyloid precursor protein 

Amyotrophic lateral sclerosis* (familial) 

21 q22.1 

Superoxide dismutase 1 


9p21.2 

C90RF72 

Angelman syndrome 

15q11-q13 

Ubiquitin-protein ligase E3A 

Ataxia-telangiectasia 

11q22.3 

Cell cycle control protein 

Bloom syndrome 

15q26.1 

RecQ helicase 

Breast cancer (familial) 

17q21 

BRCA1 tumor suppressor/DNA repair protein 


13q12.3 

BRCA2 tumor suppressor/DNA repair protein 


22 q12.1 

CHEK2 DNA repair protein 

Li-Fraumeni syndrome 

17p13.1 

P53 tumor suppressor 


22 q12.1 

CHEK2 DNA repair protein 

Cystic fibrosis 

7q31.2 

Cystic fibrosis transmembrane regulator (CFTR) 

Duchenne/Becker muscular dystrophy 

Xp21.2 

Dystrophin 

Ehlers-Danlos syndrome* 

2q31 

Collagen (COL3A1); there are numerous types of this 
disorder, most of which are produced by mutations in 
collagen genes 

Fragile X syndrome 

Xq27.3 

FMR1 RNA-binding protein 

Friedreich ataxia 

9q13 

Frataxin mitochondrial protein 

Galactosemia 

9pl3 

Galactose-1-phosphate-uridyltransferase 

Hemochromatosis (adult)* 

6p21.3 

Transferrin receptor binding protein 

Hemophilia A 

Xq28 

Clotting factor VIII 

Hemophilia B 

Xq27 

Clotting factor IX 

Hereditary nonpolyposis colorectal 

3p21.3 

MLH1 DNA mismatch repair protein 

cancer 

2 p22-p21 

MSH2 DNA mismatch repair protein 


2q31-q33 

PMS1 DNA mismatch repair protein 


7p22 

PMS2 DNA mismatch repair protein 


2p16 

MSH6 DNA mismatch repair protein 


14q24.3 

MLH3 DNA mismatch repair protein 

Huntington disease 

4p16.3 

Huntingtin 

Hypercholesterolemia (familial) 

19p13.2 

LDL receptor 

LQT1* 

11p15.5 

KCNQ1 cardiac K* channel a subunit 

LQT2 

7q35-q36 

KCNH2 cardiac K* channel 

LOTS 

3p21 

SCN5A cardiac Na* channel 

LQT4 

4q25-q27 

Ankyrin-B 

LQT5 

21 q22 

KCNE1 cardiac K* channel |3 subunit 

LQT6 

21 q22 

KCNE2 cardiac K* channel 

Marfan syndrome type 1 

15q21.1 

Fibriliin-1 

Marfan syndrome type 2 

3p22 

TGF-p receptor type 2 

Melanoma (familial)* 

9p21 

Cyclin-dependent kinase inhibitor tumor suppressor 


12q14 

Cyclin-dependent kinase 4 

Myotonic dystrophy type 1 

19q13.2-q13.3 

Protein kinase 

Myotonic dystrophy type 2 

3q13.3-q24 

Zinc finger protein 

Neurofibromatosis type 1 

17q11.2 

Neurofibromin tumor suppressor 

Neurofibromatosis type 2 

22 q12.2 

Merlin (schwannomin) tumor suppressor 

Parkinson disease (familial) 

4q21 

a-Synuclein 

Parkinson disease (autosomal 
recessive early-onset) 

6q25.2-q27 

Parkin 

Phenylketonuria 

12q24.1 

Phenylalanine hydroxylase 


Continued 
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TABLE 8-2 Examples of Well-Known Mendelian Disease-Causing Genes—cont'd 

DISEASE 

CHROMOSOME LOCATION 

GENE PRODUCT 

Polycystic kidney disease 

16p13.3-p13.12 

Polycystin-1 membrane protein 


4q21-q23 

Polycystin-2 membrane protein 


6p21-p12 

Fibrocystin receptor-like protein 

Polyposis coli (familial) 

5q21-22 

APC tumor suppressor 

Retinitis pigmentosa* (nearly 60 

3q21-q24 

Rhodopsin 

genes Identified to date for non- 

11q13 

Rod outer segment membrane protein-1 

syndromic retinitis pigmentosa: 

6p21.1 

Peripherin/RDS 

representative examples shown 

4pl6.3 

Retinal rod photoreceptor cGMP phosphodiesterase, 

here) 


P subunit 


Xp21.1 

Retinitis pigmentosa GTPase regulator 

Retinoblastoma 

13q14.1-q14.2 

pRB tumor suppressor 

Rett syndrome 

Xq28 

Methyl CpG-binding protein 2 

Sickle cell disease 

11p15.5 

P-Globin component of hemoglobin 

Tay-Sachs disease 

15q23-q24 

Hexosaminidase A 

Tuberous sclerosis type 1* 

9q34 

Hamartin tumor suppressor 

Tuberous sclerosis type 2 

16p13.3 

Tuberin tumor suppressor 

Wilson disease 

13q14.3-q21.1 

Copper transporting ATPase 

von Willebrand disease 

12p13.3 

von Willebrand clotting factor 


APC, Adenomatosis polyposis coli; ATPase, adenosine triphosphatase; GMP, guanosine monophosphate: GTPase, guanosine triphosphatase: 
LOT, long QT [syndrome]; TGF, transforming growth factor. 

*Additional disease-causing loci have been mapped and/or cloned. 


understanding disease mechanisms to develop better thera¬ 
peutic strategies. 

Although the gene identihcation examples used in this 
chapter have focused on mendelian conditions, these same 
approaches are being used to identify genes that contribute 
to the cause of common, complex diseases such as diabetes, 
hypertension, and heart disease. Because these diseases are 


influenced by multiple genes (as discussed in Chapter 12), 
gene identification tends to be especially challenging. Never¬ 
theless, the techniques discussed in this chapter are now being 
applied with considerable success in pinpointing genes that 
help to cause the common diseases that affect the majority of 
the human population. 


I STUDY QUESTIONS 


1. In Figure 8-19, a pedigree for an autosomal dominant 
disease, each family member has been typed for a four- 
allele STR marker, as shown in the autoradiogram below 
the pedigree. Determine linkage phase for the disease and 
marker locus in the affected male in generation II. Based 
on the meioses that produced the offspring in generation 
III, what is the recombination frequency for the marker 
and the disease locus? 

2. In the Huntington disease pedigree in Figure 8-20, the 
family has been typed for two two-allele markers, A and 
B. The genotypes for each marker are shown below the 
symbol for each family member, with the genotype for 
marker A above the genotype for marker B. Under the 
hypothesis that 0 = 0.0, what is the LOD score for linkage 
between each marker locus and the Huntington disease 
locus? 

3. Interpret the following table of LOD scores and recombi¬ 
nation frequencies (9): 


e 

0.0 

0.05 

0.1 

0.2 

0.3 

0.4 

0.5 

LOD 

—oo 

1.7 

3.5 

2.8 

2.2 

1.1 

0.0 


4. Two pedigrees for an autosomal dominant disease are 
shown in Figure 8-21. The families have been typed for a 
srx-allele STR. Based on the data in these families, what is 
the recombination frequency between the marker locus 
and the disease locus? What is the LOD score for linkage 
between the marker and disease loci under the hypothesis 
that 0 = 0.0? 

5. Consider the pedigree in Figure 8-22, in which an autoso¬ 
mal recessive gene is being transmitted and each family 
member has been typed for a five-allele STR polymor¬ 
phism. Carrier status in each individual has been estab¬ 
lished by an independent enzymatic assay. What is the 
recombination frequency between the STR and the disease 
locus? 

6. An autosomal dominant disease is being transmitted 
in the pedigree in Figure 8-23. A two-allele marker has 
been typed for each family member. What is the recombi¬ 
nation frequency for the marker and the disease locus? 
What is the LOD score for a recombination frequency of 
0.0? What is the LOD score for a recombination frequency 
of 0.1? 
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FIG 8-20 Pedigree for study question 2. 
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FIG 8-21 Pedigrees for study question 4. 
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4,5 1,3 3,5 1,4 3,5 

FIG 8-22 Pedigree for study question 5. 
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FIG 8-23 Pedigree for study question 6. 



FIG 8-24 Pedigree for study question 7. 


The family shown in the pedigree in Figure 8-24 has pre¬ 
sented for genetic diagnosis. The disease in question is 
inherited in autosomal dominant fashion. The family 
members have been typed for a closely linked two-allele 
marker locus. What can you tell the family about the risks 
for their offspring? 

Distinguish the differences among the concepts of 
synteny (syntenic), linkage, linkage disequilibrium, and 
association. 

A published study showed no linkage disequilibrium 
between NFl and a very closely linked marker locus. 
Explain this, keeping in mind that the NFl gene has a high 
rate of new mutations. 
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CHAPTER 


Immunogenetics 


Each day, our bodies are confronted with a formidable series 
of invaders: viruses, bacteria, and many other disease-causing 
organisms whose goal is to overcome our natural defenses. 
These defenses, known collectively as the immune system, 
consist of a diverse collection of trillions of cells. The immune 
system must be able to cope with a multitude of invading 
microorganisms, and it must be able to distinguish “self” 
from “nonself” with a high degree of accuracy. 

As one might expect, the genetic basis of the immune 
system is complex. The study of the genetics of the immune 
system, known as immunogenetics, has benefited enor¬ 
mously from new developments in gene identification and 
characterization. Most of the techniques discussed in earlier 
chapters (e.g., linkage analysis, genome-wide association 
studies, and high-throughput DNA sequencing) have been 
used to study genes responsible for the immune response. 
Many new genes have been discovered, and their functions 
and interactions have been studied intensely. This chapter 
provides a brief review of basic immunology and discusses 
the genes underlying the body’s capacity to defend against a 
highly diverse array of pathogens. Aspects of autoimmune 
disease are examined, and some of the major immunodefi¬ 
ciency disorders are discussed. 

THE IMMUNE RESPONSE: BASIC CONCEPTS 
The Innate Immune System 

When a pathogenic microorganism is encountered, the body’s 
first line of defense includes phagocytes (a type of cell that 
engulfs and destroys the microorganism) and the comple¬ 
ment system. The complement proteins can destroy microbes 
directly by perforating their cell membranes, and they can 
also attract phagocytes and other immune system agents to 
microbes by coating the microbial surface (it is because of 
this assisting role that the term complement originated). 
Natural killer cells, a specific type of lymphocyte, can 
respond to certain viral infections and some tumor cells. 
Phagocytes, complement, and natural killer cells are all part 
of the innate immune system, which is capable of respond¬ 
ing very rapidly to pathogens. 

The innate immune system is activated by general features 
that are detected in pathogens but are not found in the host. 
For example, gram-negative bacteria produce lipopolysac- 
charides, and gram-positive bacteria produce peptidoglycans. 
Some bacteria have a high percentage of unmethylated CG 
sequences, and some viruses produce double-stranded RNA. 
These distinctive features of pathogenic organisms can be 
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detected by receptor molecules located on the surfaces of 
innate immune system cells. An important example is the 
Toll-like family of receptors, named after a cell-surface recep¬ 
tor, Toll, that was first described in fruit flies. The genes 
that encode the human and fruit-fly versions of Toll-like 
receptors have a remarkable similarity to one another, attest¬ 
ing to their importance in maintaining an innate immune 
response in a wide variety of organisms. Indeed, all multi¬ 
cellular organisms are thought to possess innate immune 
systems. 

The innate immune system, which includes 
some phagocytes, natural killer cells, and the 
complement system, is an early part of the 
immune response and recognizes general fea¬ 
tures of invading microorganisms. 

The Adaptive Immune System 

Although the innate immune system typically helps to hold 
an infection in check in its early phases, it is sometimes inca¬ 
pable of overcoming the infection. This becomes the task of 
a more specialized component of the immune response, the 
adaptive immune system. As its name suggests, this part of 
the immune system is capable of changing, or adapting to 
features of the invading microorganism in order to mount a 
more specific and more effective immune response. The 
adaptive immune system is a more recent evolutionary devel¬ 
opment than the innate immune system and is found only in 
vertebrates. 

Key components of the adaptive immune response (Fig. 
9-1) include T lymphocytes (or T cells) and B lymphocytes 
(or B cells). These cells develop in the body’s primary lym¬ 
phoid organs (bone marrow for B cells and the thymus for T 
cells). In the thymus, developing T cells are exposed to a wide 
variety of the body’s peptides. Those that can recognize and 
tolerate the body’s own peptides are selected, and those that 
would attack the body’s peptides are eliminated. The B and 
T cells progress to secondary lymphoid tissues, such as the 
lymph nodes, spleen, and tonsils, where they encounter 
disease-causing microorganisms. Mature B lymphocytes 
secrete circulating antibodies, which combat infections. The 
B lymphocyte component of the immune system is some¬ 
times called the humoral immune system because it pro¬ 
duces antibodies that circulate in the blood stream. Helper T 
lymphocytes stimulate B lymphocytes and other types of T 
lymphocytes to respond to infections more effectively, and 
cytotoxic T lymphocytes can directly kill infected cells. 
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Production of T and B cells 
with many possible receptors 
for antigen 


Selection, proliferation, and differentiation of 
individual T and B cells with receptors for 
a specific antigen 


Thymus 



T regulatory 
cell 


Cytotoxic T cell 


VDJ recombination 
in B cells and T cells 



Memory T cell 



-CELLULAR 
IMMUNITY 


Memory B cell 


Plasma cell 


Antibody 


-HUMORAL 
IMMUNITY 


Central Secondary 

lymphoid lymphoid 

organs organs 

FIG 9-1 Overview of the adaptive immune response. Lymphoid stem cells from the bone 
marrow migrate to the central lymphoid organs, where they undergo cellular division and dif¬ 
ferentiation, producing either T cells (thymus) or B cells (bone marrow). At this stage, the B and 
T cells have not yet encountered antigens, but they have developed considerable diversity in 
their cell-surface receptors as a result of VDJ recombination and junctional diversity (see text). 
The cells enter the circulation and migrate to the secondary lymphoid organs (e.g., the spleen 
and lymph nodes), where they will encounter foreign antigens that are usually processed by 
antigen-presenting cells (APCs) for presentation to helper T (Th) cells. Only a small subset of T 
and B cells have receptors capable of binding to a specific foreign antigen, and this subset is 
selected for further development and differentiation. Somatic hypermutation (see text) occurs 
in B cells at this stage and results in further receptor diversity and the ability to bind the foreign 
antigen with greater affinity. The end result of this process is a humoral (B cell) and/or cellular 
(T cell) response to a foreign antigen. The T-cell response includes cytotoxic T cells that can kill 
infected cells, regulatory T cells that help to control the immune response, and memory T cells 
that enable a rapid response if the antigen is encountered later in life. Humoral immunity results 
in a population of mature B cells (plasma cells) that secrete antibodies into the circulation and a 
population of memory B cells. 


Because of this direct interaction with infected cells, the T-cell 
component of the immune system is sometimes called the 
cellular immune system. It is estimated that the body con¬ 
tains several trillion B and T cells. 

B lymphocytes are a component of the adap¬ 
tive immune system; they produce circulating 
antibodies in response to infection. T lympho¬ 
cytes, another component of the adaptive 
immune system, interact directly with infected 
cells to kill these cells, and they aid in the B-cell 
response. 


The B-Cell Response: Humoral Immune System 

A major element of the adaptive immune response begins 
when specialized types of phagocytes, which are part of the 
innate immune system, engulf invading microbes and then 
present peptides derived from these microbes on their cell 
surfaces. These cells, which include macrophages and den¬ 
dritic cells, are termed antigen-presenting cells (APCs). B 
cells are also capable of engulfing microbes and presenting 
foreign peptides on their cell surfaces. 

The APCs alert the adaptive immune system to the pres¬ 
ence of pathogens in two ways. First, the foreign peptide is 
transported to the surface of the APC by a class II major 
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histocompatibility complex (MHC) molecule, which carries 
the foreign peptide in a specialized groove (Fig. 9-2). This 
complex, which projects into the extracellular environment, 
is recognized by T lymphocytes, which have receptors on their 
surfaces that are capable of binding to the MHC-peptide 
complex. In addition, the APCs, upon encountering a patho¬ 
gen, display costimulatory molecules on their cell surfaces 
as a signal that foreign pathogens have been encountered (Fig. 
9-3). Binding to the MHC-peptide complex stimulates the 
helper T lymphocyte to secrete cytokines, which are signaling 
proteins that mediate communication between cells. In 
particular, these cytokines help to stimulate the subset of B 
lymphocytes whose cell surface receptors, termed immuno¬ 
globulins, can bind to the invading microorganism’s peptides 
(see Fig. 9-3). The immunoglobulin’s capacity to bind a spe¬ 
cific foreign peptide (i.e., its affinity for the peptide) is deter¬ 
mined by its shape and other characteristics. 

In the humoral immune response, foreign par¬ 
ticles are displayed in conjunction with class II 
MHC molecules by antigen-presenting cells. 
These displayed molecules are recognized by 
helper T cells, which then stimulate prolifera¬ 
tion of B cells whose receptors (immunoglobu¬ 
lins) can bind to the foreign pathogen. 

It is estimated that upon initial exposure to a foreign 
microbe, as few as 1 in every 1 million B lymphocytes happens 
to produce cell-surface receptors capable of binding to the 
microbe. This number is too small to fight an infection effec¬ 
tively. Furthermore, the receptor’s binding affinity is likely to 
be relatively poor. However, once this relatively small popula¬ 
tion of B lymphocytes is stimulated, they begin an adaptive 
process in which additional DNA sequence variation is gener¬ 
ated via the process of somatic hypermutation (see later 
discussion). These DNA mutations, which are confined to the 
genes that encode the cell-surface receptors, in turn produce 
alterations in the receptors’ binding characteristics (e.g., the 
shape of the protein). Some of these variant receptors possess 
a higher level of binding affinity for the microorganism. The 
B cells that produce these receptors are favorably selected 
because they bind the pathogen for a longer period of time. 
They thus proliferate rapidly. These B cells then become 
plasma cells, which secrete their cell-surface receptors, or 
immunoglobulins, into the blood stream. The secreted mol¬ 
ecules, which are structurally identical to the receptors on the 
B cell’s surface, are antibodies. It can now be seen how the 
adaptive immune system gets its name: it involves the initial 
selection of B cells and T cells whose receptors can bind with 
the pathogen, followed by subsequent fine-tuning (adapta¬ 
tion) of these cells to achieve higher binding affinity. 

During the B-cell response to a foreign peptide, 
the binding affinity of immunoglobulins for 
the invading pathogen increases. When mature, 
the B cell becomes an antibody-secreting 
plasma cell. 


After initial stimulation by the disease pathogen, the 
process of B-cell differentiation and maturation into 
antibody-producing plasma cells requires about 5 to 7 days 
for completion. Each plasma cell is capable of secreting 
approximately 10 million antibody molecules per hour. Anti¬ 
bodies bind to the pathogen’s surface antigens (a term derived 
from “anhhody generating”) and may neutralize the micro¬ 
organism directly. More often, the antibody tags the pathogen 
for destruction by other components of the immune system, 
such as complement proteins and phagocytes. 

Another important activity of the humoral immune 
response is the creation of memory B cells, a subset of high- 
affinity-binding B cells that persist in the body after the infec¬ 
tion has subsided. These cells, which have already been highly 
selected for response to the pathogen, provide a more rapid 
response should the pathogen be encountered again later in 
the individual’s life. Vaccinations are effective because they 
induce the formation of memory cells that can respond to a 
specific pathogen. 

The Cellular Immune System 

Some microorganisms, such as viruses, are very adept at 
quickly inserting themselves into the body’s cells. Here, they 
are inaccessible to antibodies, which are water-soluble pro¬ 
teins that cannot pass through the cell’s lipid membrane. A 
second component of the adaptive immune system, the cel¬ 
lular immune system, has evolved to combat such infections. 
A key member of the cellular immune response is the class I 
MHC molecule, which is found on the surfaces of nearly all 
of the body’s cells. In a normal cell, the class I MHC molecule 
binds with small peptides (8 to 10 amino acids long) derived 
from the interior of the cell. It migrates to the cell’s surface, 
carrying the peptide with it and displaying it outside the cell. 
Because this is one of the body’s own peptides, no immune 
response is elicited. In an infected cell, however, the class I 
MHC molecule can bind to small peptides that are derived 
from the infecting organism. Cell-surface presentation of 
foreign peptides by the class I MHC molecule alerts the 
immune system, T cells in particular. Recall that T lympho¬ 
cytes learn to recognize and tolerate self peptides (in conjunc¬ 
tion with MHC molecules) while developing in the thymus, 
but they are highly intolerant of foreign peptides. The MHC- 
peptide complex binds to receptors on the appropriate T cell’s 
surface, which prompts the T cell to emit a chemical that 
destroys the infected cell (Fig. 9-4). Because of their ability to 
destroy cells in this way, these T lymphocytes are termed 
cytotoxic T lymphocytes or killer T lymphocytes.’^ Each 
cytotoxic T lymphocyte can destroy one infected cell every 1 
to 5 minutes. 


’^They are also known as CD8+ T cells because of the presence of 
CDS (cluster of differentiation antigen 8) coreceptors on their sur¬ 
faces, which bind to the class I MHC molecule (see Fig. 9-4). Mature 
Th cells have CD4 coreceptors on their cell surfaces, which bind to 
class II MHC molecules (see Fig. 9-3) and are thus known as CD4+ 
T cells. 
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FIG 9-2 The humoral immune response. Class II major histocompatibility complex (MHC) mol¬ 
ecules in antigen-presenting cells carry foreign peptides to the surface of the cell, where the 
foreign peptide is recognized by a helper T cell. The T cell secretes cytokines, which stimulate 
B cells whose immunoglobulins will bind to the foreign peptide. These B cells become plasma 
cells, which secrete antibodies into the circulation to bind with the microbe, helping to combat 
the infection. (Modified from Nossal GJ. Life, death and the immune system. Sci Am. 
1993 : 269 : 53 - 62 .) 
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FIG 9-3 A detailed view of the binding between a helper T cell and a B cell. In addition to the 
binding of the T-cell receptor to the MHC-peptide complex, a number of other molecules interact 
with one another, such as the costimulatory B7-CD28 complex. MHC, major histocompatibility 
complex. (Modified from Roitt I, Brostoff J, Male D. Immunology. 6th ed. St. Louis: Mosby; 
2001 .) 


T cells are often alerted to the presence of an infection 
when circulating dendritic cells present the foreign peptides 
on their cell surfaces and migrate to secondary lymphoid 
tissues, where most of the T cells reside. As with B lympho¬ 
cytes, only a small fraction of the body’s T cells has receptors 
with binding affinity for the infecting pathogens. 

The immune system is capable of destroying 
the body's cells once they are infected. Pep¬ 
tides from the pathogen are displayed on cell 
surfaces by class I MHC molecules. These are 
recognized by cytotoxic (killer) T lymphocytes, 
which destroy the infected cell. 

In contrast to cytotoxic T cells, which destroy infected cells 
directly, helper T (Th) cells respond to the presence of patho¬ 
gens by secreting cytokines. These molecules in turn stimu¬ 
late the development of other components of the immune 
system, such as B cells and cytotoxic T cells. Th cells are clas¬ 
sified into subsets, depending on which cytokines they secrete. 
For example, ThI cells, which are involved primarily in com¬ 
batting intracellular pathogens, secrete interleukin-2 (IL-2), 
interferon-y, and tumor necrosis factor (3. Th 2 cells, which 
secrete IL-4, IL-5, IL-6, and IL-13, help to fight multicellular 
parasites and are involved in allergic responses. Other helper 


T-cell subsets include Th 17 cells, which secrete IL-17, and 
Th 22 cells, which secrete IL-22. 

As in the B-cell component of the adaptive immune 
system, a subset of long-lived T cells is retained (memory T 
cells) to quickly respond to a foreign pathogen should it be 
encountered again in the future. Yet another type of T cell, 
the regulatory T cell, helps to regulate the immune system 
so that self peptides are not inadvertently attacked. 

Helper T cells stimulate development of other 
components of the immune system, such as 
cytotoxic T cells and B cells, by secreting cyto¬ 
kines. Various subsets of helper T cells have 
been described, depending on which cytokines 
they secrete. Memory T cells remain long after 
an infection and ensure rapid response to a 
subsequent infection by the same pathogen. 
Regulatory T cells help to prevent the immune 
system from attacking the body's own cells. 

The Innate, Humoral, and Cellular Immune 
Systems: A Comparison 

Although the innate and adaptive immune systems are 
described separately, a great deal of interaction takes place 
between them, and the two systems fulfill complementary 
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FIG 9-4 In a cell infected by a virus, the viral peptides (1, 2) are carried to the cell surface by 
class I MHC molecules (3). The T-cell receptor of a CD8+ cytotoxic T cell binds to the peptide- 
MHC complex {4). Recognizing the peptide as foreign, the cytotoxic T cell secretes chemicals 
that either kill the Infected cell directly or induce it to undergo programmed cell death (apoptosis; 
see Chapter 11) (5). MHC, major histocompatibility complex. (Modified from Janeway CA Jr. 
How the immune system recognizes invaders. Sci Am. 1996;269:73-79.) 


functions. The innate system, because it recognizes general 
features of pathogens, can react very quickly to foreign ele¬ 
ments. While doing so, it signals the adaptive immune system 
to initiate a fine-tuned response to the pathogen. Without this 
signal, the adaptive immune system is incapable of respond¬ 
ing to an infection. After several days during which the adap¬ 
tive system “learns” the characteristics of the pathogen, it can 
launch a massive, specialized response. Through the creation 
of memory B and T cells, the adaptive immune system allows 
the organism to respond quickly and effectively to a pathogen 
should it be encountered again. No such memory cells exist 
for the innate immune system. 

The humoral immune system is specialized to combat 
extracellular infections, such as circulating bacteria and 
viruses. The cellular immune system combats intracellular 
infections, such as parasites and viruses within cells. However, 


this division of labor is not strict, and there is again a great 
deal of interaction between the humoral and cellular compo¬ 
nents of the immune system. 

IMMUNE RESPONSE PROTEINS: GENETIC 
BASIS OF STRUCTURE AND DIVERSITY 

Immunoglobulin Molecules and Genes 

As illustrated in Figure 9-5, each antibody (or immunoglobu¬ 
lin) molecule is composed of four chains: an identical pair of 
longer heavy chains and an identical pair of shorter light 
chains, which are linked together by disulfide bonds. There 
are five different types of heavy chains (termed y, p, a, 5, and 
e) and two types of light chains (k and X). The five types of 
heavy chains determine the major class (or isotype) to which 
an immunoglobulin (Ig) molecule belongs: y, p, a, 5, and e 
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Light chains 



Antibody (molecular model) 



Light chain 


Heavy chain 
Disulfide bond 


FIG 9-5 An antibody molecule consists of two identical light chains and two identical heavy 
chains. The light chain includes variable, joining, and constant regions; the heavy chain includes 
these regions as well as a diversity region located between its variable and joining regions. The 
upper portion of the figure depicts a molecular model of antibody structure. 


correspond to the immunoglobulin isotypes IgG, IgM, IgA, 
IgD, and IgE, respectively. Immature B lymphocytes produce 
only IgM, but as they mature, a rearrangement of heavy chain 
genes called class switching occurs. This produces the other 
four major classes of immunoglobulins, each of which differs 
in amino acid composition, charge, size, and carbohydrate 
content. Each class tends to be localized in certain parts of 
the body, and each tends to respond to a different type of 
infection. The two types of light chains can be found in asso¬ 
ciation with any of the five types of heavy chains. 

The heavy and light chains both contain a constant and a 
variable region, which are located at the carboxyl (C)-termi- 
nal and amino (N)-terminal ends of the chains, respectively. 
The arrangement of genes encoding the constant region 
determines the major class of the Ig molecule (e.g., IgA, IgE). 
The variable region is responsible for antigen recognition and 
binding and thus varies within immunoglobulin classes. 
Three distinct gene segments encode the light chains: C for 
the constant region, V for the variable region, and J for the 
region that joins the constant and variable regions. Four gene 
segments encode the heavy chains, with C, V, and J coding 
again for the constant, variable, and joining regions, respec¬ 


tively, and a “diversity” (D) region located between the joining 
and variable regions. 

Immunoglobulin molecules consist of two 
identical heavy chains and two identical light 
chains. The heavy chain constant region deter¬ 
mines the major class to which an immuno¬ 
globulin belongs. The variable regions of the 
light and heavy chains recognize and bind 
antigens. 

The Genetic Basis of Antibody Diversity 

Because the immune system cannot “know” in advance which 
types of microbes it will encounter, the system must contain 
a huge reservoir of structurally diverse immune cells so that 
at least a few cells can respond (i.e., bind) to any invading 
microbe. Indeed, the humoral immune system is capable of 
generating approximately 100 billion structurally distinct 
antibodies. At one time, it was thought that because each 
antibody has a unique amino acid sequence, each must be 
encoded by a different gene. However, this one gene-one 
antibody hypothesis could not possibly be correct, because 
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the human genome has only 20,000 to 25,000 protein-coding 
genes. Further study has shown that several mechanisms are 
responsible for generating antibody diversity in somatic cells: 

1. Multiple Germline Immunoglobulin Genes 

Molecular genetic studies (cloning and DNA sequencing) 
have shown that for each heavy and light chain, an individual 
has more than 80 different V segments located contiguously 
in his or her germline and six different J segments. There are 
at least 30 D segments in the heavy chain region. 

2. Somatic Recombination (VDJ Recombination) 

As immunoglobulin molecules are formed during B lympho¬ 
cyte maturation, a specific combination of single V and J 
segments is selected for the light chain, and another combina¬ 
tion of V, D, and J segments is selected for the heavy chain. 
This is accomplished by deleting the DNA sequences separat¬ 
ing the single V, J, and D segments before they are transcribed 
into mRNA (Fig. 9-6). The deletion process is carried out in 
part by recombinases (encoded by the RAGl and RAG2 
genes), which initiate double-strand DNA breaks at specific 
DNA sequences that flank the V and D gene segments. After 
the deletion of all but one V, D, and J segment, the nondeleted 
segments are joined by ligases. This cutting-and-pasting 
process is known as somatic recombination (in contrast to 
the germline recombination that takes place during meiosis). 
Somatic recombination produces a distinctive result: unlike 
most other cells of the body, whose DNA sequences are iden¬ 
tical to one another, mature B lymphocytes vary in terms of 
their rearranged immunoglobulin DNA sequences. Because 
there are many possible combinations of single V, J, and D 
segments, somatic recombination can generate approximately 
100,000 to 1,000,000 different types of antibody molecules. 

3. Junctional Diversity 

As the V, D, and J regions are assembled, slight variations 
occur in the position at which they are joined, and small 
numbers of nucleotides may be deleted or inserted at the 
junctions joining the regions. This creates even more varia¬ 
tion in antibody amino acid sequence. 

4. Somatic Hypermutation 

Typically, only a small subset of B cells has cell-surface recep¬ 
tors (immunoglobulins) that can bind to a specific foreign 
antigen, and their binding affinity is usually low. Once this 
subset of B cells is stimulated by a foreign antigen, they 
undergo an affinity maturation process characterized by 
somatic hypermutation of the V segments of immunoglobu¬ 
lin genes, as mentioned previously. An enzyme termed 
activation-induced deaminase causes cytosine bases to be 
replaced by uracil. Error-prone DNA polymerases are 
recruited, and DNA repair processes are modified so that 
mutations can persist in the DNA sequence. Consequently, 
the mutation rate of these gene segments is approximately 
10“^ per base pair per generation (recall that the mutation 
rate in the human genome is normally only per base pair 
per generation). This causes much additional variation in 
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FIG 9-6 A, Somatic VDJ recombination in the formation of a 
heavy chain of an antibody molecule. The functional heavy 
chain is encoded by only one segment each from the multiple 
V, D, and J segments. This produces a subset of B cells 
whose receptors have low-affinity binding for a foreign 
antigen. Once the antigen is encountered in the secondary 
lymphoid tissue, somatic hypermutation (B and C) is initiated. 
Most of the mutated receptors have little binding affinity (B), 
but eventually somatic hypermutation produces a subset of 
receptors with high-affinity binding (C). The cells that contain 
these receptors become antibody-secreting plasma cells. 


immunoglobulin-encoding DNA sequences and thus in the 
antigen-binding properties of the encoded immunoglobu¬ 
lins. Because mutation is a random process, most of the new 
receptors have poor binding affinity and are thus not selected. 
Eventually, however, somatic hypermutation produces a 
subset of immunoglobulins that have high-affinity binding 
to the foreign antigen, and the B cells that harbor these 
immunoglobulins are selected to proliferate extensively. The 
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end result is a population of mature plasma cells that secrete 
antibodies that are highly specific to the invading pathogen. 

5. Multiple Combinations of Heavy and Light Chains 

Further diversity is created by the random combination of 
different heavy and light chains in assembling the immuno¬ 
globulin molecule. 

Each of these mechanisms contributes to antibody diver¬ 
sity. Considering all of them together, it has been estimated 
that as many as 10^^ distinct antibodies can potentially be 
produced. 

Mechanisms that produce antibody diversity 
include multiple germline immunoglobulin 
gene segments, somatic recombination of the 
immunoglobulin gene segments, junctional 
diversity, somatic hypermutation, and the 
potential for multiple combinations of heavy 
and light chains. 

T-Cell Receptors 

The T-cell receptors are similar in many ways to the immu¬ 
noglobulins, or B-cell receptors. Like the immunoglobulins, 
T-cell receptors must be able to bind to a large variety of 
peptides derived from invading organisms. Unlike immuno¬ 
globulins, however, T-cell receptors are never secreted from 
the cell, and T-cell activation requires the presentation of 
foreign peptide along with an MHC molecule. Approximately 
90% of T-cell receptors are heterodimers composed of an a 
and a (3 chain, and approximately 10% are heterodimers com¬ 
posed of a Y and a 5 chain (Fig. 9-7). A given T cell has a 
population of either a-(3 receptors or y-S receptors. 

Most of the mechanisms involved in generating immuno¬ 
globulin diversity—multiple germline gene segments, VDJ 
somatic recombination, and junctional diversity—are also 
important in generating T-cell receptor diversity. Flowever, 
somatic hypermutation does not occur in the genes that 


encode the T-cell receptors. This is thought to help avoid the 
generation of T cells that would react against the body’s own 
cells (an autoimmune response, which is discussed later in 
the chapter). 

T-cell receptors are similar in function to B-cell 
receptors (immunoglobulins). Unlike immuno¬ 
globulins, however, they can bind to a foreign 
peptide only when it is presented by an MHC 
molecule. Their diversity is created by the 
same mechanisms that produce immunoglob¬ 
ulin diversity, with the exception of somatic 
hypermutation. 

THE MAJOR HISTOCOMPATIBILITY 
COMPLEX 

Class I, II, and III Genes 

The MHC, which includes a series of more than 200 genes 
that lie in a 4-Mb region on the short arm of chromosome 6 
(Fig. 9-8), is commonly classified into three groups: class I, 
class II, and class III. The class I MHC molecule forms a 
complex with foreign peptides that is recognized by receptors 
on the surfaces of cytotoxic T lymphocytes. Class I presenta¬ 
tion is thus essential for the cytotoxic T-cell response. Some 
viruses evade cytotoxic T-cell detection by down-regulating 
the expression of MHC class I genes in the cells they infect. 

Class I MHC molecules are composed of a single heavy 
glycoprotein chain and a single light chain called [32- 
microglobulin (Fig. 9-9A). The most important of the class I 
loci are labeled human leukocyte antigens A, B, and C (HLA- 
A, -B, and -C). Each of these loci has dozens or hundreds of 


Hhese molecules were named “human leukocyte antigens” because 
they were seen in early studies on leukocyte (white cell) surfaces. 
However, as mentioned previously, they are found on the surfaces 
of nearly aU cells. 




FIG 9-7 The T-cell receptor is a heterodimer 
that consists of either an a and a |3 chain or a 
yand a 8 chain. The complex of MHC molecule 
and antigen molecule is bound by the variable 
regions of the a and |3 chains. (Modified from 
Raven PH, Johnson GB. Biology. 3rd ed. St. 
Louis: Mosby: 1992.) 
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FIG 9-8 A map of the human major histocompatibility complex. The 4-Mb complex is divided 
into three regions: classes I, II, and III. 


Antigenic 



FIG 9-9 A, A class I major histocompatibility complex (MHC) molecule, showing the structure 
of the heavy chain, which consists of three extracellular domains (ai, a 2 , and as), a membrane- 
spanning domain, and a cytoplasmic domain. A groove formed by the ai and as domains carries 
peptide for presentation to T-cell receptors. The domain associates closely with the jBs- 
microglobulin (PsM) chain. B, A class II MHC molecule, showing the structure of the a and P 
chains. Each has two globular extracellular domains, a membrane-spanning domain, and a cyto¬ 
plasmic domain. The a, and Pi domains form a groove into which peptide nestles for presentation 
to T-cell receptors. (Modified from McCance KL, Huether SE. Pathophysiology: the Biologic Basis 
for Disease in Adults and Children. 7th ed. St. Louis: Elsevier; 2014:234.) 


alleles, resulting in a high degree of class I MHC variability 
among individuals. The class I region spans 1.8 Mb and 
includes a number of additional genes and pseudogenes 
(genes that are similar in DNA sequence to coding genes but 
that have been altered so that they cannot be transcribed or 
translated). 

The class I molecules were first discovered in the 1940s by 
scientists who were experimenting with tissue grafts in mice. 
When the class 1 alleles in donor and recipient mice differed, 
the grafts were rejected. This is the historical basis for the 
term major histocompatibility complex. In humans, matching 
of the donor’s and recipient’s class I alleles increases the prob¬ 
ability of graft or transplant tolerance. Because grafts and 
transplants are a relatively new phenomenon in human 
history, the MHC obviously did not evolve to effect trans¬ 
plant rejection. Instead, T cells, when confronted with foreign 
MHC molecules on donor cells, interpret these as foreign 
peptides and attack the cells. 


The class I MHC molecules are encoded by 
the highly polymorphic HLA-A, -B, and -C loci 
on chromosome 6. In addition to presenting 
foreign peptides on the surfaces of infected 
cells, they can also bring about transplant 
rejection when non-self MHC molecules stimu¬ 
late cytotoxic T cells. 

Whereas the class I MHC molecules are found on the 
surfaces of nearly all cells and can bind with cytotoxic T-cell 
receptors, the class II MHC molecules ordinarily are found 
only on the surfaces of the immune system’s APCs (e.g., 
phagocytes and B lymphocytes). When associated with foreign 
peptides, they stimulate helper T cell activity after binding to 
the T cells’ receptors, as described previously. The class II 
molecules are heterodimers consisting of an a and a (3 chain, 
each of which is encoded by a different gene located on chro¬ 
mosome 6 (see Fig. 9-9B). In addition to the genes in the 
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major class II groups (HLA-DP, -DQ, and -DR), this region 
includes genes that encode peptide transporter proteins 
(TAPI and TAP2) that help to transport peptides into the 
endoplasmic reticulum, where they initially form complexes 
with class I molecules before migrating to the cell surface. 

Class II MHC molecules are heterodimers 
encoded by genes on chromosome 6. They 
present peptides on the surfaces of antigen- 
presenting cells. These peptides, in conjunc¬ 
tion with class II MHC molecules, are bound by 
receptors on helper T cells. 

Like the major class I MHC loci, the major class II loci are 
highly polymorphic, expressing hundreds of different alleles. 
Indeed, the MHC loci are, as a class, the most polymorphic 
loci known in humans. Each MHC allele encodes a molecule 
with slightly different binding properties: some variants bind 
peptide from a given pathogen more effectively than others 
do.* Consequently, a person who expresses a greater variety 
of MHC molecules has a better chance of dealing effectively 
with diverse infectious organisms. For example, someone 
who is homozygous for each of the major class I loci (A, B, 
and C) expresses only three different class I MHC molecules 
in each cell, whereas someone who is heterozygous for each 
of these loci expresses six different class I MHC molecules in 
each cell and can cope more successfully with pathogenic 
diversity (many thousands of MHC molecules are expressed 
on a typical cell’s surface). A higher degree of polymorphism 
in the general population increases the chance that any indi¬ 
vidual in the population is heterozygous. For example, HIV- 
infected persons who are heterozygous for the HLA-A, 
HLA-B, and/or HLA-C loci have longer survival times than 
those who are homozygous at these loci. In addition, greater 
MHC polymorphism in a population decreases the chance 
that an infectious pathogen can spread easily through the 
population. Thus, the high degree of polymorphism in MHC 
genes is thought to be the result of natural selection for allelic 
variation. 

In some cases, specific MHC alleles are known to produce 
proteins that are effective against specific pathogens. For 
example, the HLA-B53 allele was shown to have a strong 
protective effect against severe malaria in the population of 
Gambia, and the HLA-DRBl* 13:02 allele protects against 
hepatitis B infection in the same population. These alleles 
produce MHC molecules that have higher-affinity binding of 
the infectious agents. 

Like the class I MHC genes, class II genes are 
highly polymorphic. This increases the ability 
of individuals and populations to respond to a 
wide variety of pathogens. 


*Because of the relatively limited repertoire of different MHC mol¬ 
ecules, the peptide-binding affinity of MHC molecules is usually 
much lower than the binding affinity of finely tuned T-cell 
receptors. 


Both class I and class II MHC molecules guide T-cell 
receptors (cytotoxic and helper, respectively) to specific cells. 
T-cell receptors recognize peptides only in combination with 
MHC molecules on cell surfaces, a phenomenon known as 
MHC restriction. Not all components of the immune system 
are MHC restricted. The complement system, for example, 
does not require direct interaction with MHC molecules. 

Some virus-infected cells and tumor cells take advantage 
of MHC restriction: they suppress the expression of MHC 
molecules on their surfaces in an attempt to evade detection 
by T cells (Clinical Commentary 9-1). Fortunately, natural 
killer cells are activated by the absence, rather than the pres¬ 
ence, of MHC molecules on cell surfaces. This activation is 
mediated by an important and diverse family of receptors 
found on the surfaces of natural killer cells, killer cell immu- 
noglobulin-like receptors (KIR). These receptors inhibit 
natural killer cells when they bind to MHC class I molecules 
on the surfaces of normal cells but activate them when MHC 
class I molecules are absent. 

The class III MHC region spans 680 kb and contains at 
least 36 genes, only some of which are involved in the immune 
response. Among the most important of these are the genes 
encoding the complement proteins. 

The genes encoding the immunoglobulins, the T-cell 
receptors, KIR, and the class I and class II MHC proteins all 
share similar DNA sequences and structural features. Thus, 
they are members of a gene family, like the globin genes, the 
color vision genes, and the collagen genes described in earlier 
chapters. Table 9-1 provides a summary of the major genes 
of the immune system and their chromosome locations. 

It is important to emphasize that the class I and class II 
MHC molecules differ greatly among individuals, but each 
cell within an individual has the same class I and class II 
molecules (this uniformity is necessary for recognition by T 
cells). In contrast, after VDJ recombination the T-cell recep¬ 
tors and immunoglobulins differ from cell to cell within indi¬ 
viduals, allowing the body to respond to a large variety of 
different infectious agents. 

The immunoglobulin, T-cell receptor, KIR, and 
MHC genes are members of a gene family. 
Whereas immunoglobulins and T-cell recep¬ 
tors vary between cells within an individual, 
MHC molecules vary between individuals. 

MHC and Disease Associations 

A number of diseases show significant associations with spe¬ 
cific MHC alleles: persons who have the allele are much more 
likely to develop the disease than are those who lack it. Some 
examples, mentioned in earlier chapters, include the associa¬ 
tion of HLA-B27 (i.e., allele 27 of the HLA-B locus) with 
ankylosing spondylitis and of HLA-DQBl with type 1 diabe¬ 
tes. An especially strong association is seen between several 
HLA-DR and -DQ alleles and narcolepsy, a disorder charac¬ 
terized by sudden and uncontrollable episodes of sleep. As 
Table 9-2 shows, most of the HLA-disease associations 
involve the class II MHC genes. 
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CLINICAL COMMENTARY 9-1 

The Immune Response as a Molecular Arms Race 

The vast majority of the pathogens that assault the human body 
are destroyed by our immune system. Consequently, there is 
strong natural selection for pathogens that can evade immune 
surveillance and destruction. These microorganisms often have 
high mutation rates, and their numbers are huge. Thus, despite 
their biological simplicity, viruses and other pathogens have 
evolved some very clever ways of overcoming the immune 
response. Our immune systems, in turn, are constantly creating 
new ways to deal with pathogenic ingenuity. Three examples 
of this molecular "arms race" are discussed here. 

Cytomegalovirus (CMV) is a common infectious agent that 
can produce mononucleosis, hemolytic anemia, pneumonitis, 
congenital infections, and thrombocytopenia (a decrease in the 
number of platelets). Cells infected with CMV are the targets 
of destruction by cytotoxic T cells. However, some CMV strains 
(as well as other viruses and tumor cells) can evade T-cell 
detection by down-regulating the expression of class I mole¬ 
cules on the surfaces of infected cells. Without viral peptide 
presentation by class I molecules, the cytotoxic T cells are blind 
to the presence of CMV, and they do not destroy the infected 
cell. At this point, the cell would normally become the target 
of natural killer cells, which attack cells that lack MHC class I 
molecules on their surfaces. But CMV has devised a way to 
outwit natural killer cells, too. The virus encodes a cell-surface 
protein that resembles class I molecules closely enough so that 
the natural killer cells mistake the viral protein for a true class 
I molecule. The viral protein is also sufficiently different irom 
a true class I MHC molecule that it does not trigger destruction 
by the more finely tuned cytotoxic T cell. In this way, CMV can 
often avoid destruction by both T cells and natural killer cells. 

An important immunological challenge for placental mammals 
is pregnancy, in which placental cells express foreign class I 
MHC molecules derived from the father. Ordinarily, such cells 
would be quickly destroyed by the mother's cytotoxic T cells. 
To avoid this, placental cells down-regulate the expression of 
class I MHC molecules on their surfaces. As with viral down- 
regulation of class I molecules, this lack of MHC class I expres¬ 
sion leaves the placental cells liable to destruction by the 
mother's natural killer cells. In this case, the cells are saved 
from destruction by presenting HLA-G molecules on their 


surface. This relatively nonvariant MHC molecule does not 
stimulate a T-cell response, which is limited to presentation of 
HLA-A, -B, and -C molecules. It does inhibit the natural killer 
ceil, which has HLA-G receptors on its surface. The fetus, like 
CMV, has thus devised ways of avoiding destruction by both 
T cells and natural killer cells. 

A third example of the molecular arms race is given by one 
of the most feared infectious agents of modern times, the 
human immunodeficiency virus (HIV). Some strains of this virus 
gain entry into macrophages and helper T cells via a cell-surface 
receptor, CC chemokine receptor 5 (CCR5). Once inside the 
cell, HIV inserts its genetic material into the nucleus and takes 
advantage of the cell's machinery to replicate itself (HIV is a 
retrovirus, a type of virus that is discussed further in Chapter 
13). Helper T cells are a critical component of the body's 
immune system, and their destruction by HIV leads to severe 
secondary immunodeficiency. Persons who are homozygous 
for a 32-bp deletion of the CCR5 gene lack the CCR5 receptor 
and are thus remarkably resistant to HIV infection. Among 
those who are heterozygous for this deletion, progression to 
AIDS symptoms after seroconversion is slowed by 2 to 4 years. 
This deletion is especially common in northeastern European 
populations, where the gene frequency reaches 0.20. It is virtu¬ 
ally absent in Asian and African populations. 

Analysis of linkage disequilibrium in the chromosomal region 
that contains CCR5 indicates that the deletion arose in Euro¬ 
pean populations only 700 to 2000 years ago. Because HIV 
appeared in humans only a few decades ago, the high gene 
frequency in northeastern Europe cannot be due to HIV and 
must be due to another selective force or perhaps to genetic 
drift. Considering the age of the deletion, it is possible that it 
underwent positive selection because it provided resistance to 
pathogens, such as smallpox, that ravaged European popula¬ 
tions in the past. Clearly, there would be strong selection in 
favor of this mutation in populations now heavily exposed to 
HIV, and the deletion would eventually rise in frequency in 
these populations. Even now, knowledge of the consequences 
of this deletion may help accelerate the medical arms race 
against HIV, where genetic modification of CCR5 is being 
tested in HIV clinical trials (see Chapter 13). 


In some cases the association between MHC alleles and a 
disease is caused by linkage disequilibrium. For example, the 
hemochromatosis locus is linked to HLA-A, and significant 
associations occur between HLA-A3 and the most common 
hemochromatosis-causing allele (see Chapter 8). There is no 
known causal link between HLA-A3 and this disorder, 
however. More likely, the association represents a past event 
in which the primary hemochromatosis mutation arose on a 
copy of chromosome 6 that had the HLA-A3 aUele. Similarly, 
the association between HLA-DQBl and HLA-DQAl and 
narcolepsy is due to linkage disequilibrium between the HLA- 
DQ region and a nearby locus that causes narcolepsy (the 
hypocretin type 2 receptor gene). 

In other cases a causal association may exist. Some MHC- 
disease associations involve autoimmunity, in which the 
body’s immune system attacks its own normal cells. For 


example, type 1 diabetes is characterized by T-cell infiltration 
of the pancreas and subsequent T-cell destruction of the 
insulin-producing beta cells. In some cases, autoimmunity 
involves “molecular mimicry.” Here, a peptide that stimulates 
an immune response is so similar to the body’s own peptides 
that the immune system begins to attack the body’s own cells. 
This phenomenon helps to explain the onset of ankylosing 
spondylitis, another autoimmune disease. Infections of HLA- 
B27-positive persons with specific microbes, such as Klebsi¬ 
ella, can lead to a cross-reaction in which the immune system 
mistakes peptides from some of the body’s normal cells for 
microbial peptides. Another such example is given by rheu¬ 
matic fever, in which a streptococcal infection initiates cross¬ 
reactivity between streptococcus and cardiac myosin. In each 
of these scenarios, the body already has a small population of 
self-reactive T cells, but they remain inactive and quite 
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TABLE 9-1 Chromosome Location and Function of Major Immune Response Genes 

GENE SYSTEM 

CHROMOSOME 

LOCATION 

GENE PRODUCT FUNCTION 

Immunoglobulin heavy chain 

14q32 

Heavy chain, the first part of antibody molecule, which binds 

(C, V, D, and J genes) 


foreign antigens 

Immunoglobulin k light chain 

2pl3 

Light chain, the second part of antibody molecule 

(C, V, and J genes) 
Immunoglobulin X light chain 

22q11 

Light chain, the second part of antibody molecule (either k or 7. 

(C, V, and J genes) 


may be used) 

T-cell receptor a 

14q11 

One chain of the a-p T-cell receptor, which recognizes antigen 

T-cell receptor p 

7q35 

with MHC molecule 

The second chain of the a-p T-cell receptor 

T-cell receptor y 

7p15 

One chain of the y-5 T-cell receptor 

T-cell receptor 5 

14q11 

The second chain of the y-b T-cell receptor 

MHC (classes I, II, and III); 

6p21 

Cell-surface molecules that present peptides to T-cell receptors. 

includes TAPI and TAP2 


TAPI and TAP2 are transporter molecules that process foreign 

Pj-microglobulin 

15q21-22 

peptides and carry them to the endoplasmic reticulum. 

Forms second chain of the class 1 MHC molecule 

RAGl, RAG2 

11p13 

Recombinases that participate in VDJ somatic recombination 


TABLE 9-2 Examples of Major 
Histocompatibility Complex and Disease 


Associations 



DISEASE 

MHC (HLA) 
ASSOCIATED 
LOCUS (LOCI)* 

APPROXIMATE 
RELATIVE RISK' 

Type 1 diabetes 

DQBl, DQA1 

10 

Ankylosing 

B27 

90 

spondylitis 

Narcolepsy 

DR2 and DQA1, 

>100 

Celiac disease 

DQBl 

DQA1, DQBl 

10 

Rheumatoid arthritis 

DRB1, DQA1 

5 

Myasthenia gravis 

C, DR3, DR7 

2.5 

Multiple sclerosis 

DRB1 

4 

Systemic lupus 

DRBl 

6 

erythematosus 

Hemochromatosis 

A3 

20 

Malaria 

B53 

0.6059 

Graves disease 

DR3 

5 

Psoriasis 

C 

13 

Abacavir 

B57 

»1000 

(anti-HIV drug) 
hypersensitivity 


*For simplicity, specific alleles are not shown here. For example, 
the HLA-B57 allele associated with abacavir sensitivity is labeled 
HLA-B*57:0h and the allele associated with psoriasis is 
HLA-C*06:02. 

■^Relative risk can be interpreted loosely as the odds that a person 
who has a risk factor (in this case, an MFIC antigen) will develop 
the disease, compared with a person who lacks the risk factor. 
Thus, a relative risk of 4 for DRB1 and multiple sclerosis means 
that persons with a specific DRB1 allele are four times more likely 
to develop multiple sclerosis than are those without it. A relative 
risk <1 (as seen for malaria and B53) indicates that the factor is 
protective against the disease. These relative risks can vary among 
different human population groups. 


harmless until they are stimulated to proliferate by a foreign 
peptide that closely resembles a self peptide. 

Autoimmunity can also be caused by specific defects in the 
regulation of immune system components. For example, 
regulatory T cells help to prevent the formation of self¬ 
reactive immune cells and require a transcription factor, 
encoded by FOXP3, for their normal development. Mutations 
in FOXP3 result in a deficiency of regulatory T cells and 
an autoimmune disease called IPEX (immunodysregulation, 
polyendocrinopathy, enteropathy, X linked). 

Other common diseases that involve autoimmunity 
include rheumatoid arthritis, systemic lupus erythematosus, 
psoriasis, and multiple sclerosis. It is estimated that approxi¬ 
mately 5% of the population suffers from some type of auto¬ 
immune disease. 

A significant number of diseases are associ¬ 
ated with specific MHC alleles. Some of these 
associations are the result of linkage disequi¬ 
librium, but most are likely to result from 
causal associations involving autoimmunity. 

The ABO and Rh Blood Groups 

Another component of the immune system involves red 
blood cell surface molecules that can cause an immune 
reaction during blood transfusions. The ABO and Rh red¬ 
cell antigen systems were discussed in Chapter 3 as early 
examples of polymorphic marker loci. They are also the 
most important systems that determine blood transfusion 
compatibility. 

The ABO System 

There are four major ABO blood types: A, B, AB, and O. The 
first three groups respectively represent persons who carry 
the A, B, or A and B antigens on their erythrocyte surfaces. 
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Those with type O have neither the A nor the B antigen. 
Persons who have one of these antigens on their erythrocyte 
surfaces possess antibodies against all other ABO antigens in 
their blood stream. (These antibodies are formed early in life 
as a result of exposure to antigens that are identical to the A 
and B antigens but are present in various microorganisms.) 
Thus, if a type B person received type A or AB blood, his or 
her anti-A antibodies would produce a severe and possibly 
fatal reaction. Type O persons, who have neither the A nor 
the B antigen and thus both anti-A and anti-B antibodies, 
would react strongly to blood of the other three types (A, B, 
and AB). It was once thought that type O persons, because 
they lack both types of antigens, could be “universal donors” 
(anyone could accept their blood). Similarly, type AB persons 
were termed “universal recipients” because they lacked both 
anti-A and anti-B antibodies. However, when patients are 
given transfusions of whole blood containing large volumes 
of serum, the donor’s antibodies can react against the recipi¬ 
ent’s erythrocyte antigens. Hence, complete ABO matching is 
nearly always done for blood transfusions. 

The ABO locus encodes red cell antigens that 
can cause a transfusion reaction if donors and 
recipients are not properly matched. 

The Rh System 

The Rh blood group is encoded by two tightly linked loci, one 
of which is labeled D. The other locus produces Rh antigens 
labeled C and E through alternative splicing of the messenger 
RNA. The D locus is of primary interest because it is respon¬ 
sible for Rh maternal-fetal incompatibility and the resulting 
disease, hemolytic disease of the newborn (HDN). Persons 
with the DD or Dd genotype have the Rh antigen on their 
erythrocytes and are Rh-positive. The recessive homozygotes, 
with genotype dd, are Rh-negative and do not have the Rh 
antigen. About 85% of persons of European ancestry are 
Rh-positive and about 15% are Rh-negative. 

Unlike the ABO system, in which antibodies normally are 
formed in response to antigens presented by other organisms, 
anti-Rh antibody production requires a stimulus by the 
human Rh antigen itself An Rh-negative person does not 
begin to produce anti-Rh antibodies unless he or she is 
exposed to the Rh antigen, usually through a blood transfu¬ 
sion or during pregnancy. Maternal-fetal incompatibility 
results when an Rh-positive man and an Rh-negative woman 
produce children. If the man’s genotype is DD, all of their 
offspring will be Rh-positive and will have Rh antigens on 
their erythrocytes. If the man is a heterozygote, with genotype 
Dd, half of their children will be Rh-positive, on average. 

There are usually no difficulties with the first Rh- 
incompatible child, because very few of the fetus’s red blood 
cells cross the placental barrier during gestation. When the 
placenta detaches at birth, a large number of fetal red blood 
cells typically enter the mother’s blood stream. These cells, 
carrying the Rh antigens, stimulate production of anti-Rh 
antibodies by the mother. These antibodies persist in the 
blood stream for a long time, and if the next offspring is again 
Rh-positive, the mother’s anti-Rh antibodies enter the fetus’s 


bloodstream and destroy its red blood cells. As this destruc¬ 
tion proceeds, the fetus becomes anemic and begins to release 
many erythroblasts (immature nucleated red cells) into its 
blood stream. This phenomenon is responsible for the 
descriptive term erythroblastosis fetalis. The anemia can lead 
to a spontaneous abortion or stillbirth. Because the maternal 
antibodies remain in the newborn’s circulatory system, 
destruction of red cells can continue in the neonate. This 
causes a buildup of bilirubin and a jaundiced appearance 
shortly after birth. Without replacement transfusions, in 
which the child receives Rh-negative red cells, the bilirubin is 
deposited in the brain, producing cerebral damage and 
usually death. Infants who do not die can develop intellectual 
disability, cerebral palsy, and/or high-frequency deafness. 

Among persons of European descent, approximately 13% 
of all matings are Rh-incompatible. fortunately, a simple 
therapy now exists to avoid Rh sensitization of the mother. 
During and after pregnancy, an Rh-negative mother is given 
injections of Rh immune globulin, which consists of anti-Rh 
antibodies. These antibodies destroy the fetal erythrocytes in 
the mother’s blood stream before they stimulate production 
of maternal anti-Rh antibodies. Because the injected antibod¬ 
ies do not remain in the mother’s blood stream for long, they 
do not affect subsequent offspring. To avoid sensitization, 
these injections must be administered with each pregnancy. 
The Rh-negative mother must also be careful not to receive 
a transfusion containing Rh-positive blood, because this 
would also stimulate production of anti-Rh antibodies. 

Maternal-fetal Rh incompatibility (Rh-negative 
mother and Rh-positive fetus) can produce 
hemolytic disease of the newborn if the moth¬ 
er's Rh antibodies attack the fetus. Administra¬ 
tion of Rh immune globulin to the mother 
prevents this reaction. 

A rarer form of maternal-fetal incompatibility can result 
when a mother with type O blood carries a fetus with type 
A or B blood. The hemolytic disease produced by this com¬ 
bination is usually so mild that it does not require treat¬ 
ment. Interestingly, if the mother is also Rh-negative and the 
child is Rh-positive, the ABO incompatibility protects against 
the more severe Rh incompatibility. This is because any fetal 
red blood cells entering the mother’s circulatory system are 
quickly destroyed by her anti-A or anti-B antibodies before 
she can form anti-Rh antibodies. 

IMMUNODEFICIENCY DISEASES 

Immunodeficiency disease results when one or more com¬ 
ponents of the immune system (e.g., T cells, B cells, MHC, 
complement proteins) are missing or fail to function normally. 
Primary immunodeficiency diseases are caused by abnor¬ 
malities in cells of the immune system and are usually pro¬ 
duced by genetic alterations. To date, more than 100 different 
primary immunodeficiency syndromes have been described, 
and it is estimated that these diseases affect at least 1 in 
10,000 persons. Secondary immunodeficiency occurs when 
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components of the immune system are altered or destroyed 
by other factors, such as radiation, infection, or drugs. For 
example, the human immunodeficiency virus (HIV), which 
causes acquired immunodeficiency syndrome (AIDS), attacks 
macrophages and helper T lymphocytes, central components 
of the immune system. The result is increased susceptibility 
to a multitude of opportunistic infections. 

B-cell immunodeficiency diseases render the patient espe¬ 
cially susceptible to recurrent bacterial infections, such as 
Streptococcus pneumoniae. An important example of a B-cell 
immunodeficiency is X-linked agammaglobulinemia (XLA). 
Patients with this disorder, the overwhelming majority of 
whom are male, lack B cells completely and have no IgA, IgE, 
IgM, or IgD in their serum. Because IgG crosses the placenta 
during pregnancy, infants with XLA have some degree of 
humoral immunity for the first several months of life. 
However, the IgG supply is soon depleted, and the infants 
develop recurrent bacterial infections. They are treated with 
large amounts of y-globulin. XLA is caused by mutations in 
a gene (BTK) that encodes a B-cell tyrosine kinase necessary 
for normal B-ceU maturation. Mutations in the genes that 
encode the immunoglobulin heavy and light chains can cause 
a form of autosomal recessive B-cell immunodeficiency. 

T-cell immunodeficiency diseases directly affect T cells, 
but they also affect the humoral immune response, because 
B-cell proliferation largely depends on helper T cells. Thus, 
in the most severe T-cell defects, affected patients develop 
severe combined immune deficiency (SCID) and are suscep¬ 
tible to many opportunistic infections, including Pneumocys¬ 
tis proved (a protozoan that commonly infects AIDS patients). 
Without bone marrow transplants, these patients usually die 
within the first several years of life. About half of SCID cases 
are caused by X-linked recessive mutations in a gene that 
encodes the y chain found in six different cytokine receptors 
(those of interleukins 2, 4, 7, 9, 15, and 21). Lacking these 
receptors, T cells and natural killer cells cannot receive the 
signals they need for normal maturation. These receptors all 
interact with an intracellular signaling molecule called Jak3. 
As might be expected, persons who lack Jak3 as a result of 
autosomal recessive mutations in the JAK3 gene experience a 
form of SCID that is very similar to the X-linked form just 
described. 

About 15% of SCID cases are caused by adenosine deami¬ 
nase (ADA) deficiency, an autosomal recessive disorder of 
purine metabolism that results in a buildup of metabolites 
that are toxic to B and T cells. This type of SCID, as well as 
the X-linked form, can be treated by bone marrow transplan¬ 
tation, and some cases are being treated successfully with 
gene therapy (see Chapter 13). 

SCID can also arise from mutations in RAGl or RAG2, two 
of the genes involved in VDJ recombination and the proper 
formation of T-cell and B-cell receptors. These mutations 
produce a combined B-cell and T-cell immunodeficiency, 
although normal natural killer cells are produced. Other 
examples of SCID are given in Table 9-3. 

Several immune system defects result in lymphocytes that 
lack MHC molecules on their surfaces. These are collectively 


termed bare lymphocyte syndrome, one form of which is 
caused by mutations in the TAP2 gene. TAP2 encodes a 
protein that helps to transport peptides to the endoplasmic 
reticulum, where they are bound by class 1 MHC molecules. 
A defect in the TAP2 protein destabilizes the class 1 MHC 
molecules so that they are not expressed on the cell surface. 
Because exposure to MHC molecules is necessary for normal 
T-cell development in the thymus, bare lymphocyte syndrome 
results in a severe reduction in the number of functional T 
and B cells. Bare lymphocyte syndrome can also be caused by 
defects in several different transcription factors that bind to 
promoters in the class 11 MHC region. The result is a lack of 
class 11 MHC molecules on APCs, a deficiency of helper T 
cells, and a consequent lack of antibody production. 

Chronic granulomatous disease (CGD) is a primary 
immunodeficiency disorder in which phagocytes can ingest 
bacteria and fungi but are then unable to kill them. This 
brings about a persistent cellular immune response to the 
ingested microbes, resulting in the formation of the granulo¬ 
mas (nodular inflammatory lesions containing macrophages) 
for which the disease is named. These patients develop pneu¬ 
monia, lymph node infections, and abscesses in the skin, liver, 
and other sites. The most common cause of CGD is an 
X-linked mutation, but there are also at least three autosomal 
recessive genes that can cause CGD. The gene that causes 
X-linked CGD was the first disease-causing gene to be iso¬ 
lated through positional cloning. It encodes a subunit of cyto¬ 
chrome b, a protein that the phagocyte requires for a burst of 
microbe-killing oxygen metabolism. 

Multiple defects in the various proteins that make up the 
complement system have been identified. Most of these are 
inherited as autosomal recessive disorders, and most result in 
increased susceptibility to bacterial infections. 

Finally, a number of syndromes include immunodeficiency 
as one of their features. One example is the DiGeorge sequence 
(see Chapter 6), in which a lack of thymic development leads 
to T-cell deficiency. Wiskott-Aldrich syndrome is an X-linked 
recessive disorder that involves deficiencies of platelets and 
progressive T-cell dysfunction. It is caused by loss-of-function 
mutations in a gene (WAS) whose protein product is expressed 
in hematopoietic cells, where it relays signals from the cell 
surface to the cytoskeleton. Wiskott-Aldrich syndrome, like 
SCID, has been successfully treated with gene therapy (see 
Chapter 13). Immunodeficiency is also seen in several syn¬ 
dromes that involve DNA instability (e.g., ataxia telangiecta¬ 
sia, Bloom syndrome, Fanconi anemia; see Chapter 3). 

Primary immunodeficiency diseases involve 
intrinsic defects of immune response cells 
(e.g., B cells, T cells, MHC, complement system, 
or phagocytes) and are usually caused by 
genetic alterations. Secondary immunodefi¬ 
ciency disorders, of which AIDS is an example, 
are caused by external factors. Immunodefi¬ 
ciency is also seen in a number of genetic 
syndromes, including several DNA instability 
disorders. 
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TABLE 9-3 Examples of Primary Immunodeficiency Diseases 


MODE OF 


CONDITION 

INHERITANCE 

BRIEF DESCRIPTION 

X-linked agammaglobulinemia 

XR 

Absence of B cells leads to recurrent bacterial infections 

SCID (y chain cytokine receptor defect or 

ADA deficiency) 

XR, AR 

T-cell deficiency leading also to impairment of humoral immune 
response; fatal unless treated by bone marrow transplantation 
or gene therapy 

SCID due to Jak3 deficiency 

AR 

Protein kinase deficiency leading to T-cell deficiency, NK cell 
deficiency, and impaired humoral immune response. 

SCID (also Omenn syndrome) due to RAG1 
or RAG2 deficiency 

AR 

Lack of recombinase activity impairs VDJ recombination, which 
leads to B-cell and T-cell deficiency 

SCID due to interleukin-7 a chain deficiency 

AR 

T-cell deficiency leading to impaired B-cell response 

Zap70 kinase deficiency 

AR 

Lack of cytotoxic T cells; defective helper T cells; impaired 
antibody response 

Purine nucleoside phosphorylase deficiency 

AR 

Purine metabolism disorder leading to T-cell deficiency 

Bare lymphocyte syndrome (BLS) 

AR 

Deficient MHC class 1 expression (TAP2 mutation) leads to 

T-cell and B-cell deficiency in type 1 BLS; mutations in 
transcription factors for MHC class II genes lead to a relative 
lack of helper T cells in type 2 BLS 

Complement system defects 

Mostly AR 

Increased susceptibility to bacterial and other infections 

DiGeorge anomaly 

AD, sporadic 

Congenital malformations include abnormal facial features, 
congenital heart disease, and thymus abnormality leading to 
T-cell deficiency 

Ataxia telangiectasia 

AR 

DMA repair defect characterized by unsteady gait (ataxia), 
telangiectasia (dilated capillaries), and thymus abnormality 
producing T-cell deficiency 

Wiskott-Aldrich syndrome 

XR 

Abnormal, small platelets, eczema, and abnormal T cells 
causing susceptibility to opportunistic infections 

Chediak-Higashi syndrome 

AR 

Partial albinism, defective lysosomal assembly, giant 
cytoplasmic granules, abnormal natural killer cells, and 
neutrophils leading to recurrent bacterial infections 

Leukocyte adhesion deficiency 

AR 

Mutations in integrin receptor genes produce phagocytes that 
cannot recognize and ingest microorganisms, which results in 
severe bacterial infections 

Chronic granulomatous disease 

XR, AR 

Phagocytes ingest microbes but cannot kill them; leads to 
formation of granulomas and recurrent infections 

Hyper IgE syndrome 

IRAK-4 deficiency 

AD, AR 

Recurrent staphylococcal infections, markedly elevated serum 

IgE levels, progressively coarse facial features 

Toll-like receptor defect caused by deficiency of interleukin-1 
receptor associated kinase-4 (IRAK-4), resulting in extracellular 
bacterial (especially Streptococcus pneumoniae) infections. 

AD, Autosomal dominant; AR, autosomal recessive, 

SC/D, severe combined immune deficiency: XR, X-linked recessive. 

■ STUDY QUESTIONS 




1. Compare the functions of class I and class II MHC 
molecules. 

2. MHC molecules and immunoglobulins both display a 
great deal of diversity but in different ways. How and why 
do these types of diversity differ? 

3. In what ways are T-cell receptors and immunoglobulins 
similar? In what ways are they different? 

4. If there are 80 V segments, 6 J segments, and 30 D seg¬ 
ments that can encode an immunoglobulin heavy chain of 
one particular class, how many different immunoglobu¬ 
lins can be formed on the basis of somatic recombination 
alone? 


5. When matching donors and recipients for organ trans¬ 
plants, siblings are often desirable donors because they are 
more likely to be HLA-compatible with the recipient. If we 
assume no crossing over within the HLA loci and assume 
four distinct HLA haplotypes among the parents, what is 
the probability that a transplant recipient will be HLA- 
identical with a sibling donor? 

6. What types of matings will produce Rh maternal-fetal 
incompatibility? 
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Genetic Basis of Development 


Developmental genetics is the study of how the instructions 
encoded in genes control and coordinate development of an 
organism, beginning with fertilization and ending with death. 
The field is important to health-care providers because muta¬ 
tions in genes can perturb developmental processes, resulting 
in increased risk for birth defects, intellectual disability, and 
cancer. Worldwide, about 6% of all live-born children have a 
major birth defect (i.e., one that has a substantial impact on 
health) so =8 million children are born each year with a major 
birth defect. Birth defects are the leading cause of infant”^ 
death in the United States and are associated with substantial 
morbidity. 

Birth defects can be isolated abnormalities, or they can be 
features of one of several thousand known genetic syndromes. 
The etiology of most birth defects is unknown; however, a 
substantial fraction is caused by mutations in genes, either 
alone or in combination, that control normal development. 
The characterization of genes and developmental processes 
that coordinate animal development has revolutionized our 
understanding of the molecular basis of human birth defects. 
This chapter provides a brief review of the genes and proteins 
that control development and then discusses some of the 
cardinal developmental processes that, when disturbed, cause 
birth defects. 

DEVELOPMENT _ 

Basic Concepts 

Animal development can be defined as the process by which 
a fertilized ovum becomes a mature organism capable of 
reproduction. A single fertilized egg divides and grows to 
form different cell types, tissues, and organs, all of which are 
arranged in a species-specific body plan (i.e., the arrange¬ 
ment and pattern of body parts). Many of the instructions 
necessary for normal development are encoded by an ani¬ 
mal’s genes. Because the genes in each cell of an organism are 
identical, several questions arise: How do cells with identical 
genetic constitutions form a complex adult organism com¬ 
posed of many different cells and tissues? What controls the 
fate of each cell, instructing a cell to become, for example, a 
brain cell or a liver cell? How do cells organize into discrete 
tissues? How is the body plan of an organism determined? 
Answering such fundamental questions has been a major 


*An infant is a person who is younger than 1 year. 


focus of developmental biology for more than a century. The 
pace of discovery has accelerated dramatically in the past 
several decades, and these discoveries are helping us to under¬ 
stand the causes of human malformations and genetic 
syndromes. 

For ethical and technical reasons, it is difficult to study 
early developmental events in human embryos. Consequently, 
a variety of nonhuman model organisms are used to facilitate 
the study of development (Table 10-1). This approach is fea¬ 
sible because the major elements (genes and pathways) that 
control animal development are conserved across a wide 
range of species and body plans. In addition, many regulatory 
switches and signaling pathways are used repeatedly during 
development to control various patterning and differentia¬ 
tion events. This underscores the point that the evolution of 
species proceeds, in part, by continual tinkering with similar 
developmental programs to effect changes in an organism’s 
phenotype. 

For example, ectopic expression (i.e., expression of the 
gene product in an abnormal location) of the Drosophila gene 
eyeless results in the formation of a well-formed but mis¬ 
placed eye (Fig. 10-1). Mice have a homologous gene, Pax6,' 
in which mutations can produce abnormally small eyes. 
When inserted ectopically into Drosophila, Pax6 again pro¬ 
duces a misplaced fly eye. Mutations in the human homolog, 
PAX6, cause defects of the eye such as cataracts and aniridia 
(absence of the iris). PAX6, Pax6, and eyeless are homologous 
genes that encode DNA transcription factors (see Chapter 3). 
Although the ancestors of Drosophila and mouse diverged 
from the lineage leading to humans 500 million and 60 
million years ago, respectively, the genes and pathways 
involved in development of the eye have been conserved. 

Approximately 2% to 3% of babies are born 
with a recognizable birth defect. Many birth 
defects are caused by mutations in genes 
encoding elements in pathways that control 
development. These pathways are highly con¬ 
served among animal species. Thus, studies of 
nonhuman animal models are invaluable for 
understanding human development and the 
causes of birth defects. 


Tt is conventional to capitalize all letters of the names of human 
genes but only the first letter of the names of mouse genes, except 
for recessive mutations, which begin with a lower-case letter. 
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TABLE 10-1 Animal Models of Human Development 


ORGANISM 

GENERATION 

TIME* 

ADVANTAGES 

DISADVANTAGES 

Caenorhabditis elegans 
(roundworm) 

9 days 

Fate of every cell known 

Genome well characterized 

Easy to breed and maintain 

Alternative body plan compared to 
vertebrates 

Tissues cannot be cultured 

Drosophila melanogaster 
(fruit fly) 

10 days 

Easy to breed 

Large populations 

Vast database of mutants 

Feasible and affordable to do large 

screens 

Alternative body plan compared to 
vertebrates 

Must be stored live; cannot be frozen 
Pathology often different compared 
to humans 

Danio redo (zebrafish) 

3 months 

Transparent embryo 

Easy to maintain 

Large populations 

Feasible and affordable to do large 

screens 

Targeted gene modification difficult 

Xenopus laevis (frog) 

12 months 

Transparent embryo is large and easy to 
manipulate 

Tetraploid genome makes genetic 
experiments difficult 

Callus gallus (chicken) 

5 months 

Easy to observe and manipulate embryo 

Genetic experiments difficult 

Mus musculus (mouse) 

2 months 

Pathology similar to humans 

Excellent tools for phenotypic 
characterization 

Targeted gene modification 
straightforward 

Fully annotated genome available 

Relatively expensive to maintain 
Manipulation of embryo is 
challenging 

Papio hamadryas (baboon) 

60 months 

Pathology and physiology similar to that 
of humans 

Very expensive to maintain 

Small populations 

Long generation time 

Strong ethical concerns with use of 
primates 


*Generation time is defined as the age at which the organism is first capable of reproduction. 


Aniridia Small eye Eyeless 



FIG 10-1 Evolutionary relationships among homologs of PAX6 and associated phenotypes illus¬ 
trating functional conservation of Pax genes among humans, mice, and Drosophila. A, Mutations 
in human PAX6 cause aplasia of the irides, or aniridia. B, Loss of function of murine Pax6 causes 
small eyes (left) compared to a wild-type mouse (right). C, Misexpression of eyeless in tissue 
fated to become an antenna causes formation of a normal but misplaced eye (arrow) instead. 
(A from Jones KL. Smith's Recognizable Patterns of Human Malformation. 6th ed. Philadelphia: 
Mosby; 2006:53. B courtesy James Lauderdale, University of Georgia. C from Science. 1995 
Mar 24; 267[5205]. Reprinted with permission From AAAS.) 


A Brief Overview of Major Processes in 
Embryonic Development 

Several major processes are involved in the development of 
the embryo. These include axis specification, pattern forma¬ 
tion, and organogenesis. As the name suggests, pattern 


formation describes a series of steps in which differentiated 
cells are arranged spatially to form tissues and organs. The 
interactions of these cells are mediated by processes such as 
induction, which occurs when the cells of one embryonic 
region influence the organization and differentiation of cells 
in a second region. Axis specification involves the definition 
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of the major axes of the body: ventral/dorsal, anterior/ 
posterior, medial/lateral, and left/right. Specification of 
polarity (direction) is an important part of this process. As 
the axes are specified, the formation of organs and limbs 
(organogenesis) begins. Each of these major processes 
involves many different proteins that form structures and 
provide signals to coordinate the development of the embryo. 
The major types of these proteins, and the genes that encode 
them, are described next. 

Embryonic development involves the pro¬ 
cesses of pattern formation, axis specification, 
and organogenesis. Each of these processes is 
controlled by a series of proteins that provide 
signals and form structures necessary for 
normal development of the embryo. 

GENETIC MEDIATORS OF DEVELOPMENT: 

THE MOLECULAR TOOLBOX 

The genes required for normal development encode many 
different products, including signaling molecules and their 
receptors, DNA transcription factors, components of the 
extracellular matrix, enzymes, transport systems, and other 
proteins. Each of these genetic mediators is expressed in com¬ 
binations of spatially and temporally overlapping patterns 
that control different developmental processes. As detailed in 
this chapter, mutations in the genes mediating development 
are a common cause of human birth defects. 

Paracrine Signaling Molecules 

Interactions between neighboring cells are usually mediated 
by proteins that can diffuse across small distances to induce 
a response. These molecules are often called paracrine factors 
because they are secreted into the space surrounding a cell 
(unlike hormones, which are secreted into the blood stream). 
Closely related paracrine factors have been isolated from a 
variety of organisms, making it clear that homologous mol¬ 
ecules are used throughout the animal kingdom. To date, 
four major families of paracrine signaling molecules have 
been described: the Fibroblast Growth Factor (FGF) family, 
the Hedgehog family, the Wingless (Wnt) family, and the 
Transforming Growth Factor (3 (TGF-|3) family. Each of these 
signaling molecules binds to one or more receptors to effect 
a response, and mutations in genes encoding these molecules 
may lead to abnormal communication between cells. Clinical 
Commentary 10-1 discusses the FGF family and associated 
receptors; the other three families are described in this 
section. 

The first member of the Hedgehog family was originally 
isolated in a Drosophila mutant that has bristles in an area 
that is naked in the normal fly (hence the name “hedgehog”). 
Vertebrates have several homologs of hedgehog, the most 
widely used of which is termed Sonic hedgehog (Shh). Among 
its many roles, Shh participates in axis specification, induc¬ 
tion of motor neurons within the neural plate, and patterning 
of the limbs. The primary receptor of Shh is a transmembrane 


protein encoded by a gene called patched. The normal action 
of patched is to inhibit the function of another transmem¬ 
brane protein called smoothened, encoded by the gene Smo. 
Binding of Shh to the patched receptor results in disinhibition 
of smoothened and activation of an intracellular signaling 
cascade that targets members of the GFI family of transcrip¬ 
tion factors (Fig. 10-2). 

Mutations in the human homolog, PATCHED (PTC), 
cause Gorlin syndrome (see Fig. 10-2), a disorder character¬ 
ized by rib anomalies, cysts of the jaw, and basal cell carcino¬ 
mas (a form of skin cancer). Somatic mutations in PTC have 
also been found in sporadic basal cell carcinomas. Thus, 
germline mutations in PTC alter the regulation of developing 
cells to cause birth defects, and somatic mutations in PTC can 
alter the regulation of terminally differentiated cells to cause 
cancer. 

The Wnt family of genes is named after the Drosophila 
gene wingless and one of its vertebrate homologs, integrated. 
The wingless gene establishes polarity during Drosophila limb 
formation, and members of the Wnt family play similar roles 
in vertebrates. Wnt genes encode secreted glycoproteins that 
bind to members of the frizzled and low-density lipoprotein 
(FDF) receptor-related protein families. In humans, 19 dif¬ 
ferent Wnt genes have been identified, and they participate 
in three signaling pathways: the canonical Wnt pathway, the 
noncanonical planar cell polarity pathway, and the nonca- 
nonical Wnt/calcium pathway. These pathways regulate a 
wide variety of developmental processes, including specifica¬ 
tion of the dorsal/ventral axis and formation of the brain, 
muscle, gonads, and kidneys. Homozygosity for mutations in 
WNT3 causes tetra-amelia (absence of all four limbs) in 
humans, and abnormal Wnt signaling has been associated 
with the formation of tumors. 

The TGF-(3 supergene family’ is composed of a large 
group of structurally related genes that encode proteins that 
form homodimers or heterodimers. Members of the TGF-|3 
supergene family include the TGF-|3 family itself, the bone 
morphogenetic protein (BMP) family, the activin family, 
and the Vgl family. Although the role of BMPs is not limited 
to bone development, members of the BMP family were 
originally isolated because of their ability to induce bone 
formation. 

Mutations in a member of the BMP family, cartilage- 
derived morphogenetic protein 1 (CDMPt), cause various 
skeletal abnormalities. Different mutations can produce dis¬ 
tinct phenotypes (allelic heterogeneity; see Ghapter 4). For 
example, a nonsense mutation in CDMPl causes dominantly 
inherited brachydactyly (short digits). Persons homozygous 
for a 22-bp duplication in CDMPl have brachydactyly as well 
as shortening of the long bones of the limbs in an autosomal 
recessive disorder called acromesomelic dysplasia. A homo¬ 
zygous missense mutation produces autosomal recessive 
Grebe chondrodysplasia, also characterized by severe short¬ 
ening of the long bones and digits. The mutant protein is not 


*A supergene family is a group of related gene families. 
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SHH 






FIG 10-2 The Sonic hedgehog ('S/ihj-Patched (Ptch)-G\\ pathway signaling pathway and associ¬ 
ated disorders. Shh protein to which a cholesterol moiety has been attached binds to Patched. 
Smo is normally inhibited by Patched, but when bound by Shh protein, this inhibition is released, 
and Smo is available to activate downstream targets such as Gli transcription factors. Cyclic 
adenosine monophosphate (cAMP) response element-binding protein (CREBBP) is a cofactor of 
Gli transcription factors. Disorders caused by perturbation of proteins in this pathway include: 
A, holoprosencephaly (Shh; abnormal midline brain development); B, Smith-Lemli-Opitz (choles¬ 
terol biosynthesis; see Chapter 7); C, Gorlin or nevoid basal cell carcinoma syndrome (PTCH; rib 
anomalies, cysts of the jaw, and basal cell skin cancer); D, Greig cephalopolysyndactyly syndrome 
(Gli; craniosynostosis, polydactyly); and E, Rubenstein-Taybi syndrome (CREBBP; intellectual 
disability, distinctive facial features, and broad thumbs). (Photos from Jones K. Smith's Recogniz¬ 
able Patterns of Human Malformation. 6th ed. Philadelphia: Saunders; 2006:116; diagram modi¬ 
fied from Turnpenny P, Ellard S. Emery's Elements of Medical Genetics. 13th ed. London: 
Churchill Livingstone; 2007.) 


CLINICAL COMMENTARY 10-1 
Disorders of Fibroblast Growth Factor Receptors 

The four fibroblast growth factor receptors (FGFRs) are highly 
homologous glycoproteins, with a common structure that con¬ 
sists of a signal peptide (an amino acid sequence that helps 
to direct the protein to its proper cellular location), three 
immunoglobulin-like (Ig-like) domains, a membrane-spanning 
segment, and an intracellular tyrosine kinase domain. FGFRs 
are receptors for 22 fibroblast growth factors (FGFs) that par¬ 
ticipate in a wide variety of biological processes that include 
cell migration, growth, and differentiation (e.g., limb formation, 
brain patterning, etc). With varying affinities, FGFs bind FGFRs, 
which leads to phosphorylation, and hence activation, of the 
tyrosine kinase domain. 

FGFRs are widely expressed in developing bone, and many 
common human autosomal dominant disorders of generalized 


bone growth (i.e., skeletal dysplasias) are caused by mutations 
in FGFR genes. The most prevalent of these disorders, affect¬ 
ing more than 250,000 people worldwide, is achondroplasia 
(ACPI), which is characterized by disproportionate short stature 
(i.e., the limbs are disproportionately shorter than the trunk) 
and macrocephaly (see Chapter 4). Nearly all persons with ACEI 
have a glycine —> arginine substitution in the transmembrane 
domain of FGFR3, resulting in constitutive FGFRS activation 
(Fig. 10-3). 

FGFRS is normally expressed in resting chondrocytes, where 
it restrains chondrocyte proliferation and differentiation. Over¬ 
activation causes further inhibition of chondrocyte growth, 
which produces skeletal defects. The degree of FGFRS activa¬ 
tion, which varies depending on the domain that is altered. 
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CLINICAL COMMENTARY 10-1 

Disorders of Fibroblast Growth Factor Receptors—cont'd 


TTTAA 


FGFR3 



SP 


igi 


AB 


Igii 


igiii 


TM 


H 


Kinase 


Kinase 


Hypochondroplasia (H) 


Thanatophoric dysplasia (T) 


Achondroplasia (A) 



FIG 10-3 Top, Schematic drawing of fibroblast growth factor 3 receptor (FGFR3) protein. Impor¬ 
tant functional domains of FGFR3 include a signal peptide (SP), three immunoglobulln-llke (Ig) 
domains, an acid box (AB), a transmembrane (TM) domain, and a split tyrosine kinase (Kinase) 
domain. The locations of point mutations causing achondroplasia (A; red}, hypochondroplasia (FI; 
yellow}, and thanatophoric dysplasia (T; light blue) are indicated. Bottom, Photographs of children 
with mutations in FGFR3. A, A boy with hypochondroplasia. He has mildly short limbs relative 
to his trunk. B, An infant with thanatophoric dysplasia, the most common of the lethal skeletal 
dysplasias. He has markedly shortened limbs and a very narrow thoracic cage. C, A girl with 
achondroplasia. She has short limbs relative to the length of her trunk (resulting in redundant 
skinfolds in the arms and legs), a prominent forehead, and a depressed nasal root. (Modified 
from Webster MK, Donoghue DJ. FGFR activation in skeletal disorders: Too much of a good 
thing. Trends Genet. 1997;13:178-182.) 


corresponds to the severity of long-bone shortening. Lesser 
degrees of activation result in the milder skeletal abnormalities 
observed in hypochondroplasia, whereas markedly increased 
activation causes a virtually lethal short-stature syndrome 
called thanatophoric dysplasia. 

These findings suggest that a potential therapeutic strategy 
might be to reduce or block the activity of FGFR3. However, 
keeping some residual FGFR3 function is necessary for 
normal development. In mice, FGF/?3-inactivating mutations 
cause expansion of the zones of proliferating cartilage 


and increased long bone growth, resulting in mice that are 
longer than average. A similar average increase in height 
has been observed in humans with a rare syndrome that is 
caused by a FGFR3 mutation predicted to reduce FGFR3 
function. 

Abnormal bone growth is also a feature of a group of auto¬ 
somal dominant disorders characterized by premature fusion 
(synostosis) of the cranial sutures, misshapen skulls, and 
various types of limb defects. Collectively, these disorders are 
called craniosynostosis syndromes. Mutations in FGFR1, 


Continued 
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CLINICAL COMMENTARY 10-1 

Disorders of Fibroblast Growth Factor Receptors—cont'd 

FGFR2, and FGFR3 cause at least six distinct craniosynostosis 
disorders, the best-known of which is Apert syndrome. The 
same mutation can sometimes cause two or more different 
craniosynostosis syndromes. For example, a cysteine —> 


tyrosine substitution in FGFR2 can cause either Pfeiffer or 
Crouzon syndrome. This suggests that additional factors, such 
as modifying genes, are partly responsible for creating different 
phenotypes (Fig. 10-4) (Table 10-2). 



FIG 10-4 A, Face of a child with Apert syndrome. Note that the skull is tall and narrow. The 
eyes are protuberant because the bony orbits are shallow. B, Hand of a child with Apert syn¬ 
drome showing a broad thumb and fusion of all of the digits (syndactyly). (From Jones KL. Smith's 
Recognizable Patterns of Human Malformation. 6th ed. Philadelphia: Mosby; 2006.) 


TABLE 10-2 Craniosynostosis Syndromes Caused by Mutations in Fibroblast Growth 
Factor Receptors 

GENE 

SYNDROME 

CHARACTERISTIC 

FGFRl 

Pfeiffer 

Broad first digits, hypertelorism 

FGFR2 

Apert 

Mid-face hypoplasia, fusion of digits 


Pfeiffer 

Broad first digits, hypertelorism 


Crouzon 

Mid-face hypoplasia, ocular proptosis 


Beare-Stevenson 

Mid-face hypoplasia, corrugated skin 


Jackson-Welss 

Mid-face hypoplasia, foot anomalies 

FGFR3 

Crouzon 

Mid-face hypoplasia, ocular proptosis, acanthosis nigricans* 


Muenke (nonsyndromic craniosynostosis) 

Mid-face hypoplasia, brachydactyly, hearing loss 


*Acanthosis nigricans is characterized by hyperplastic and hypertrophic skin of varying pigmentation, most often covering the axillae, 
neck, genitalia, and flexural surfaces. 


secreted, and it is thought to inactivate other BMPs by 
forming heterodimers with them and preventing their secre¬ 
tion. Thus, mutations causing Grebe chondrodysplasia 
produce a novel type of dominant negative effect by inacti¬ 
vating the products of other genes. 

To effect a response, extracellular signals must be trans¬ 
duced by a cell. One of the best known of these signal- 
transduction systems is the RTK/Ras GTPase/MAPK^ 
(RTK-MAPK) signaling pathway. The RTK-MAPK pathway 


regulates various cellular functions such as gene expression, 
division, differentiation, and death. Accordingly, the RTK- 
MAPK pathway is used widely during development. Muta¬ 
tions in genes that encode several components of the 
RTK-MAPK pathway have been found to cause human 


^GTPase, guanosine triphosphatase; MAPK, mitogen-activated 
protein kinase; RTK, receptor tyrosine kinase. 
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FIG 10-5 Mutations in genes in the RAS-MAPK signaling pathway cause: A, Noonan syndrome 
(short stature, webbing of the neck, and congenital heart disease); B, Costello syndrome (intel¬ 
lectual disability, thick lips, depressed nasal bridge, curly hair, cardiomyopathy); C, cardio-facial- 
cutaneous syndrome (intellectual disability, prominent forehead, hypertelorism, skin abnormalities); 
and D, neurofibromatosis type 1 (see Chapter 4). RAS-MAPK, RTK/Ras GTPase/MAPK signaling 
system (GTPase, guanosine triphosphatase; MAPK, mitogen-activated protein kinase; RTK, 
receptor tyrosine kinase). (Photos from Jones KL. Smith's Recognizable Patterns of Human 
Malformation. 6th ed. Philadelphia: Saunders; 2006:127; diagram modified from Turnpenny P, 
Ellard S. Emery's Elements of Medical Genetics. 13th ed. London: Churchill Livingstone; 2007.) 


malformation syndromes (Fig. 10-5). The best-known of 
these, Noonan syndrome, is characterized by short stature, 
characteristic facial features, webbing of the neck, and con¬ 
genital heart disease—most commonly stenosis of the pul¬ 
monary outflow tract. Most cases of Noonan syndrome are 
caused by gain-of-function mutations in the protein tyrosine 
phosphatase, non-receptor-type, 11 {PTPNH) gene, which 
encodes a protein that interacts with the RTK-MAPK pathway. 
The clinical characteristics of Noonan syndrome overlap with 
those of two rarer conditions, Costello syndrome and cardio- 
faciocutaneous syndrome (CFC) syndrome. These conditions 
are caused by mutations in other components of the RTK- 
MAPK pathway (see Fig. 10-5). Thus, disruption of different 
components of the same signaling or developmental pathway 
cause different malformation syndromes with overlapping 
clinical features. 

Other secreted proteins inhibit the function of BMPs. In 
humans, mutations in the gene Noggin, which encodes one 
of these inhibitors, cause fusion of the bones in various joints. 
In some affected persons, the joints initially appear to be 
normal. However, they are progressively obliterated by for¬ 
mation of excess cartilage that ultimately fuses the bones of 


the joint together (i.e., a synostosis) as the person ages. The 
primary joints affected are those of the spine, the middle ear 
bones, and the limbs, particularly the hands and feet. Affected 
persons experience progressively limited movement of these 
joints and develop hearing loss. 

Paracrine signaling molecules are secreted, 
diffuse a short distance, and bind to a receptor 
that effects a response. There are four major 
families of paracrine signaling molecules: the 
fibroblast growth factor (FGF) family, the 
Fledgehog family, the Wingless (Wnt) family, 
and the Transforming Growth Factor (3 (TGF-P) 
family. 

DNA Transcription Factors 

There are many different ways to regulate the expression of 
a gene. For example, a gene might not be transcribed, the 
rate of transcription may be altered, or the transcribed 
mRNA might not be translated into protein. As discussed in 
Chapter 2, genes encoding proteins that turn on (activate) or 
turn off (repress) other genes are called transcription factors. 
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Transcription factors commonly do not activate or repress 
only a single target. Often they regulate the transcription of 
many genes that, in turn, regulate other genes in a cascading 
effect. Consequently, mutations in transcription factor genes 
typically have pleiotropic effects. 

There are many different families of transcription factors, 
members of which commonly share specific properties 
such as a common DNA-binding domain. Members of these 
different families have pivotal roles in controlling develop¬ 
ment, and alterations can cause birth defects. Examples 
include homeobox-containing genes such as the HOX, PAX, 
EMX, and MSX families; high-mobility group (HMG)-box- 
containing genes such as the SOX family; and the T-box 
family. 

The HMG domain of SOX proteins appears to activate 
transcription indirectly by bending DNA so that other 
factors can make contact with promoter regions of genes. 
Several SOX genes act in different developmental pathways. 
The prototypic SOX gene is the SRY (sex-determining region 
of the Y chromosome) gene, which encodes the mammalian 
testis-determining factor (see Clinical Commentary 6-2 


in Chapter 6). Sox9 is expressed in the genital ridges of 
both sexes, but it is up-regulated in males and down- 
regulated in females before differentiation of the gonads. 
Sox9 also regulates chondrogenesis and the expression of 
CoUAl, a collagen gene (see Chapter 2). As might be pre¬ 
dicted from these expression and interaction patterns, muta¬ 
tions in SQX9 cause a disorder characterized by skeletal 
defects (campomelic dysplasia) and sex reversal that pro¬ 
duces XY females. Mutations in a related gene, SQXIO, result 
in a syndrome characterized by Hirschsprung disease (hypo- 
motility of the bowel caused by a reduced number of enteric 
nerve cells), pigmentary disturbances, and deafness (Clinical 
Commentary 10-2). 

There are many different families of tran¬ 
scription factors, each of which regulates the 
transcription of specific genes. The same tran¬ 
scription factor is often used in different devel¬ 
opmental pathways. Thus, disorders caused 
by mutations in genes encoding transcription 
factors are often pleiotropic. 



CLINICAL COMMENTARY 10-2 
Defects of Neural Crest Development 

During neurulation, neural crest cells migrate from the neuro¬ 
epithelium along defined routes to tissues, where they differ¬ 
entiate into various cell types. One fate of neural crest cells is 
to populate the small and large bowel (i.e., the enteric tract) 
with nerve cells to create the enteric nervous system. These 
cells partly control and coordinate the normal movements of 
the enteric tract that facilitate digestion and transport of bowel 
contents. Reduced or absent nerve cells in the enteric tract 
causes a disorder called Hirschsprung disease (HSCR). 

HSCR occurs in approximately 1 in 5000 live births, although 
its incidence varies among ethnic groups. Additionally, there is 
a sex bias, with males affected four times as often as females. 
The major characteristic of HSCR is hypomotility of the bowel, 
which leads to severe constipation. The disease often mani¬ 
fests in the newborn period, although it is also found in children 
and sometimes adults. If untreated, bowel hypomotility can 
lead to obstruction and gross distention of the bowel. Conse¬ 
quently, HSCR was formerly termed congenital megacolon. 

In approximately 70% of cases, HSCR occurs as an isolated 
trait, and affected persons have no additional problems. 
However, HSCR is also a well-recognized feature of many 
multiple birth defect syndromes, such as trisomy 21 and 
Waardenburg syndrome. For the last few decades, HSCR has 
been considered an example of a disorder that fits a multifacto¬ 
rial model of inheritance (i.e., caused by a combination of genes 
plus environmental factors; see Chapter 12). However, it has 
become clear that half of all cases of nonsyndromic familial 
HSCR and 15% to 20% of sporadic cases are caused by muta¬ 
tions in one of at least six different genes. Study of these 
genes provides us with a window through which we can 
observe the development of neural crest cells. 

Most commonly, HSCR is caused by mutations that inacti¬ 
vate the RET (rearranged during transfection) gene, which 


encodes a receptor tyrosine kinase (other mutations in RET 
have been associated with cancer; see Chapter 1 1). Dozens of 
different mutations have been found, including missense and 
nonsense mutations as well as deletions encompassing the 
RET gene. Thus, haploinsufficiency is the most likely mecha¬ 
nism by which mutations in RET cause HSCR. The penetrance 
of RET mutations is higher in males than in females, suggest¬ 
ing that sex-specific modifiers of the phenotype might exist. 

Normal signaling through RET appears to be required for 
neural crest migration into the distal portions of the bowel and 
for differentiation into nerve cells. One ligand for RET is glial 
cell line derived neurotrophic factor (GDNF). 

Mutations of another cell membrane receptor, endothelin-B 
(EDNRB), or its ligand, endothelin-3 (EDN3), also cause HSCR. 
Penetrance appears to vary by sex and genotype. In a large 
Mennonite community, persons homozygous for an EDNRB 
mutation were four times more likely to develop HSCR than 
were heterozygotes. In addition to HSCR, some persons with 
mutations in either EDNRB or EDN3 have melanocyte abnor¬ 
malities that produce hypopigmented patches of skin and sen¬ 
sorineural hearing loss (normal melanocytes are required for 
auditory development). This disorder is the Waardenburg-Shah 
syndrome. Thus, normal EDNRB and EDN3 signaling is required 
for the development of neural crest cells into enteric nerve 
cells and melanocytes. 

Mutations in the SOX10 transcription factor gene also have 
been found in persons with Waardenburg-Shah syndrome. 
Disruption of the homologous mouse gene, SoxlO, causes 
coat spotting and aganglionic megacolon. Although SOXgenes 
are involved in many different biological processes, the role of 
SOX10 in neural crest cell development remains to be deter¬ 
mined (Fig. 10-6). 
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CLINICAL COMMENTARY 10-2 
Defects of Neural Crest Development—cont'd 



Mid 



FIG 10-6 A, Fates of selected populations of neural crest cells migrating from different levels 
along the anterior/posterior axis of the developing embryo. Appropriate fate of neural crest deriva¬ 
tives depends on normal cell migration and terminal differentiation. B, Defects of neural crest 
cells can cause Hirschsprung disease or Waardenburg-Shah syndrome (see text). (Photo from 
Jones KL. Smith's Recognizable Patterns of Human Malformation. 6th ed. Philadelphia: Mosby; 
2006; diagram modified from Gilbert S. Developmental Biology. 7th ed. Sunderland, Mass: 
Sinauer; 2003:429.) 


Extracellular Matrix Proteins 

Extracellular matrix proteins (EMPs) are secreted macromol¬ 
ecules that serve as scaffolding for all tissues and organs. These 
molecules include collagens, fibrillins, proteoglycans, and 
large glycoproteins such as fibronectin, laminin, and tenascin. 
EMPs are not simply passive structural elements. By separat¬ 
ing adjacent groups of cells and forming matrices on which 
cells can migrate, they are active mediators of development. 
Examples are the proteins encoded by fibrillin-1 and elastin 
coordinate microfibril assembly in the extracellular matrix. 
Mutations in these two genes result in Marfan syndrome (see 
Chapter 4) and supravalvular aortic stenosis (see Chapter 6), 
respectively. Both of these conditions are characterized by 
abnormalities of the heart and/or large blood vessels. 

To facilitate cell migration, EMPs must transiently adhere 
to a cell’s surface. This is commonly accomplished by at least 
two families of cell surface receptors: integrins and glycosyl- 
transferases. Integrins are so named because they integrate 
the extracellular matrix and the cytoskeleton, allowing them 
to function in tandem. Attachment between cells and the 
extracellular matrix can be more permanent as well. One 
group of molecules that make such attachments are called 


laminins. Mutations in LAMC2, a gene that encodes a subunit 
of laminin, cause autosomal recessive junctional epidermoly¬ 
sis bullosa (JEB). Because of an inability of epithelial cells to 
properly anchor themselves to the basement membrane, the 
skin of persons with JEB spontaneously forms large blisters. 

EMPs are secreted macromolecules that serve 
as a dynamic scaffolding for tissues and 
organs. They are also active mediators of 
development. 

PATTERN FORMATION 

The process by which ordered spatial arrangements of dif¬ 
ferentiated cells create tissues and organs is called pattern 
formation. The general pattern of the animal body plan is 
laid down during embryogenesis. This leads to the formation 
of semiautonomous regions of the embryo, in which the 
process of pattern formation is repeated to form organs and 
appendages. Such regional specification takes place in several 
steps: definition of the cells of a region, establishment of 
signaling centers that provide positional information, and 
differentiation of cells within a region in response to 



























206 


CHAPTER 10 Genetic Basis of Development 


additional cues. For example, cells in the developing verte¬ 
brate upper limb differentiate into many cell types, including 
muscle (myocytes), cartilage (chondrocytes), and bone cells 
(osteocytes). Ftowever, these cells must also be arranged in 
a temporal-spatial pattern that creates functional muscle 
and bone. Additional information is required to determine 
whether a bone becomes an ulna or a humerus. Flow do 
particular structures develop in specific places? How do cells 
acquire information about their relative positions? Answering 
such questions is an area of intense investigation. 

For pattern formation to occur, cells and tissues commu¬ 
nicate with each other through many different signaling path¬ 
ways. It has become clear that these pathways are used 
repeatedly and are integrated to control specific cell fates (i.e., 
the eventual location and function of the cell). For example, 
the Shh protein is involved in patterning of the vertebrate 
neural tube, somites, and limbs, as well as the way the left side 
is distinguished from the right. Point mutations in the human 
Shh gene, SHH, can cause abnormal midline brain develop¬ 
ment (holoprosencephaly; see Fig. 10-2), severe intellectual 
disability, and early death. (Not aU affected persons have 
holoprosencephaly, however; some have only minor birth 
defects, such as a single upper central incisor.) Attachment of 
the SHH protein to cholesterol appears to be necessary for 
proper patterning of hedgehog signaling. This might partly 
explain how midline brain defects could be caused by some 
environmental substances that inhibit embryonic cholesterol 
biosynthesis and by genetic disorders of cholesterol metabo¬ 
lism such as Smith-Lemli-Opitz syndrome (see Fig. 10-2). 

The process by which ordered spatial arrange¬ 
ments of differentiated cells create tissues and 
organs is called pattern formation. Regional 
specification takes place in several steps: defi¬ 
nition of the cells of a region, establishment of 
signaling centers that provide positional infor¬ 
mation, and differentiation of cells within a 
region in response to additional cues. 

Gastrulation 

Gastrulation is the process of cell and tissue movements 
whereby the cells of the blastula are rearranged so that they 
have new positions and neighbors. In the human embryo, 
gastrulation occurs between days 14 and 28 of gestation. In 
this process, the embryo is transformed into a three-layer 
(trilaminar) structure composed of three germ layers: ecto¬ 
derm (outer layer), endoderm (inner layer), and mesoderm 
(middle layer) (Fig. 10-7). The formation of these layers is a 
prerequisite for the next phase of development, organogen¬ 
esis. The major structural feature of mammalian gastrulation 
is the primitive streak, which appears as a thickening of 
epiblast tissue extending along the anterior/posterior axis (see 
Fig. 10-7). In placental animals such as humans, gastrulation 
includes formation of the extraembryonic tissues. As might 
be predicted, the process of gastrulation is dominated by cell 
migration. Thus, many of the genes expressed during gastru¬ 
lation encode proteins that facilitate cell movement. 


Human gastrulation is characterized by cell 
and tissue movements that result in the forma¬ 
tion of three germ layers: ectoderm, endo¬ 
derm, and mesoderm. The major structural 
feature of mammalian gastrulation is the prim¬ 
itive streak. 

Neurulation and Ectoderm 

Once a trilaminar embryo is formed, the dorsal mesoderm 
and the overlying ectoderm interact to form the hollow 
neural tube. This event, called neurulation, is mediated by 
induction. In amphibians, induction of the neural tube and 
transformation of the flanking mesoderm to create an embryo 
with clear anterior/posterior and dorsal/ventral axes is con¬ 
trolled by a group of cells known as the Spemann-Mangold 
organizer. A number of proteins are expressed almost exclu¬ 
sively in the organizer. Chordin is a secreted protein that 
prevents dorsalized mesoderm from being ventralized. 
Another secreted protein. Noggin, induces neural tissue from 
dorsal ectoderm and dorsalizes the mesoderm. Understand¬ 
ing the major functions of the organizer and the molecules 
that mediate these functions is an area of active research. 

Neurulation is a critical event in development because it 
initiates organogenesis and divides the ectoderm into three 
different cell populations: the neural tube, which will eventu¬ 
ally form the brain and spinal cord; the epidermis of the skin; 
and the neural crest cells. In humans, neural tube closure 
begins at five separate sites, which correspond to the locations 
of common neural tube defects such as anencephaly (absence 
of the brain), occipital encephalocele, and lumbar spina 
bifida (see Chapter 12). Neural crest cells migrate from the 
neuroepithelium along defined routes to tissues, where they 
differentiate into cell types such as sensory neurons, melano¬ 
cytes, neurons of the small bowel, and smooth muscle cells 
(see Clinical Commentary 10-2). 

Induction is the process by which cells of one 
embryonic region influence the organization 
and differentiation of cells in a second embry¬ 
onic region. Neurulation initiates organogen¬ 
esis and induces the ectoderm to divide into 
the neural tube and neural crest cells. Defects 
of neural tube closure and neural crest migra¬ 
tion or differentiation cause some types of 
birth defects. 

Mesoderm and Endoderm 

The formation of a layer of mesoderm between the endoderm 
and ectoderm is one of the major events in gastrulation. 
Mesoderm can be divided into five components: the noto¬ 
chord; the dorsal, intermediate, and lateral mesoderms; and 
head mesenchyme.’ The notochord is a transient midline 
structure that induces the formation of the neural tube and 


’Mesenchyme is tissue that forms the connective tissues, blood 
vessels, and lymphatic vessels. 
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FIG 10-7 H uman gastrulation. A, Sagittal section through the midline of an embryo embedded 
in the uterine lining. B, Dorsal surface of an embryo exposed by removing part of the embryonic 
mesoderm surrounding the amniotic cavity and yolk sac. Arrows denote ingressing epiblast cells. 
On days 14 to 15, epiblast cells replace hypoblast cells to form endoderm. A day later, migrating 
epiblast cells are creating a layer of mesoderm. 


body axis. The adjacent dorsal mesoderm on either side of 
the notochord differentiates into elements that form the axial 
skeleton, skeletal muscles, and connective tissue of the skin. 
Intermediate mesoderm forms the kidneys and genitourinary 
system. Lateral plate mesoderm differentiates into the heart, 
the appendicular skeleton, the connective tissue of viscera 
and the body wall, and the connective tissue elements of the 
amnion and chorion. Finally, the muscles of the eyes and head 
arise from head mesenchyme. 

The primary function of embryonic endoderm is to form 
the linings of the digestive tract and the respiratory tree. 
Outgrowths of the intestinal tract form the pancreas, gall¬ 
bladder, and liver. A bifurcation of the respiratory tree pro¬ 
duces the left and right lungs. The endoderm also produces 
the pharyngeal pouches, which, in conjunction with cells 


derived from the neural crest, give rise to endodermal-lined 
structures such as the middle ear, thymus, parathyroids, and 
thyroid. 

A process common to endoderm-derived structures is 
budding and branching. This process appears to be con¬ 
trolled, in part, by FGFs, BMPs, and their respective receptors. 
Mutations in fibroblast growth factor receptor 3 (FGFR3), one 
of four FGFRs, cause three different skeletal dysplasias (see 
Clinical Commentary 10-1). The most severe of these, thana¬ 
tophoric dysplasia, is caused by mutations that activate 
FGFR3, resulting in shortened long bones, a poorly developed 
vertebral column, a small thoracic cage, and a relatively large 
skull. Children with thanatophoric dysplasia may also have 
pulmonary hypoplasia and brain anomalies, suggesting that 
FGFR3 plays a role in formation of the lung and brain. 
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Formation of a layer of mesoderm between 
endoderm and ectoderm is one of the major 
events of gastrulation. Mesoderm contributes 
to the formation of the skeleton, urogenital 
system, and limbs. Endoderm lines the diges¬ 
tive and respiratory tracts and forms visceral 
organs and the lungs. 

Axis Specification 

Animal body plans have evolved into a wide variety of sym¬ 
metries. Some animals, such as the sea anemones, are com¬ 
pletely symmetrical. Other animals (e.g., starfish) exhibit 
only a dorsal/ventral symmetry. Many animals, such as 
worms, add an anterior/posterior axis. All chordates (animals 
that develop a notochord) have a third axis that is perpen¬ 
dicular to the first two, the left/right axis. Specification and 
formation of these axes are critical events in development 
because they determine the orientation of the body plan. The 
proteins mediating these processes are rapidly being discov¬ 
ered. Many of these mediators have additional roles in pat¬ 
terning of the body plan and tissues. 

Formation of the Anterior/Posterior Axis 

The anterior/posterior axis of a developing mammalian 
embryo is defined by the primitive streak. At the anterior end 
of the primitive streak is a structure called the node. Expres¬ 
sion of a gene called nodal is required for initiating and main¬ 
taining the primitive streak; later in gastrulation, its expression 
is important in forming the left/right axis (Clinical Com¬ 
mentary 10-3). 

Patterning along the anterior/posterior axis is controlled 
by a cluster of genes that encode transcription factors that 
contain a DNA-hinding region, the homeodomain, of 
approximately 60 amino acids. These genes compose the 
homeotic gene complex (HOM-C) in Drosophila (Fig. 10-8), 
the organism in which they were first isolated through muta¬ 
tion identification. (A classic example of such a mutation, 
termed antennapedia, disturbs axis patterning so that anten¬ 
nae are replaced by legs.) In contrast to Drosophila, four 
copies of HOM-C (termed HoxA through HoxD) are found 
in humans and mice. Each of these four 100-kb gene clusters 
is located on a different chromosome, and there are 39 Hox 
genes divided among the clusters. Mammalian Hox genes are 
numbered from 1 to 13, although not every cluster contains 
13 genes. Equivalent genes in each complex (e.g., Hoxali, 
Hoxcl3, Hoxdl3) are called paralogs. 

Hox genes are expressed along the dorsal axis from the 
anterior boundary of the hindbrain to the tail. Within each 
cluster, 3' Hox genes are expressed earlier than 5' Hox genes 
(temporal colinearity). Also, the 3' Hox genes are expressed 
anterior to the 5' Hox genes (spatial colinearity). Thus, 
Hoxal expression occurs earlier and in a more anterior loca¬ 
tion than does the expression of Hoxal (see Fig. 10-8). These 
overlapping domains of Hox gene expression produce com¬ 
binations of codes that specify the positions of cells and 
tissues. Collectively these codes identify various regions 
along the anterior/posterior axis of the trunk and limbs. 


To study the role of different Hox genes in mammalian 
development, it has been common to produce a knockout 
mouse—a mouse that lacks a functional copy of the gene of 
interest (Box 10-1). 

The anterior/posterior axis of a developing 
mammalian embryo is defined by the primitive 
streak and patterned by combinations of Hox 
genes. Collectively, these combinations iden¬ 
tify various regions along the anterior/posterior 
axis of the body and limbs. Disruption of Hox 
genes produces defects in body, limb, and 
organ patterning. 

Formation of the Dorsal/Ventral Axis 

Dorsal/ventral patterning of the vertebrate depends on the 
interaction between dorsalizing and ventralizing signals. As 
previously mentioned, noggin and chordin encode secreted 
proteins that are capable of dorsalizing ventral mesoderm 
and restoring dorsal structures that have been ventralized. In 
contrast, Bmp-4 is expressed ventrally and induces ventral 
fates, patterning the dorsal/ventral axis. Noggin and chordin 
bind directly to Bmp-4 to prevent it from activating its recep¬ 
tor. Thus, the organizer promotes dorsalization by repressing 
a ventralizing signal encoded by Bmp-4. This mechanism, 
in which a signal promotes one process by repressing 
another competing process, is a common feature of embry¬ 
onic development. 

Dorsal/ventral patterning of the embryo is an 
active process that is coordinated by signaling 
molecules and their antagonists. 

Formation of Organs and Appendages 

The formation of organs and limbs (organogenesis) occurs 
after gastrulation. Many of the proteins used during this 
process are the same ones used earlier in embryonic develop¬ 
ment. As might be expected, a number of genes that were 
transcriptionally silent now become active. To date, most of 
the developmental genes known to cause human birth defects 
have prominent roles in this phase of development. This 
could represent an ascertainment bias, because mutations in 
genes that disrupt earlier developmental events may be lethal. 

Craniofacial Development 

Development of the craniofacial region is directly related to 
the formation of the underlying central nervous system. In 
mammalian embryos, neural crest cells from the forebrain 
and midbrain contribute to the nasal processes, palate, and 
mesenchyme of the first pharyngeal pouch. This mesenchyme 
forms the maxilla, mandible, incus, and malleus. The neural 
crest cells of the anterior hindbrain migrate and differentiate 
to become the mesenchyme of the second pharyngeal pouch 
and the stapes and facial cartilage. Cervical neural crest cells 
produce the mesenchyme of the third, fourth, and sixth pha¬ 
ryngeal arches (in humans the fifth pharyngeal arch 

Text continued on p. 213 
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FIG 10-8 A, Distribution of 8 Hox genes in a single cluster in Drosophila and 39 Hox genes 
among clusters on 4 chromosomes in human (HOX). Individual Hox genes are labeled from 1 
(3') to 13 (5') within each cluster. Hox genes that share the same number but are located in 
different clusters are called paralogs (e.g., HOXA)3 and HOXD)3 are paralogs). Paralogs often 
exhibit more sequence homology than do different Hox genes in the same cluster. Hox genes 
are expressed from 3' to 5' along the anterior/posterior axis of the embryo, and Hox genes 
located 3' are expressed earlier than Hox genes located 5'. B, Schematic diagram of combinato¬ 
rial codes of overlapping Hox gene expression domains along the anterior/posterior body axis. 
Hox codes determine the identity of each segment. Thus, if expression of Hoxb4 is eliminated 
(e.g., in a knockout), the combinatorial code in the third segment is altered from 1 -t 2 -t 3 to 
1 -H 2. This results in transformation of the third segment into another second segment. The 
transformation of one structure into another is called a homeotic transformation. (Modified 
from Verakas A, Del Campo M, McGinnis W. Developmental patterning genes and their con¬ 
served functions: From model organisms to humans. Mol Genet Metab. 2000;69:85-100, with 
permission.) 


BOX 10-1 Animal Models in the Study of Human Development 


There are significant obstacles to the study of genes that affect 
human development. Many of these genes are expressed 
embryonically, and it is difficult (or in some cases ethically 
undesirable) to analyze human embryos directly. Humans have 
relatively small family sizes and a long generation time. Human 
mating patterns are often not conducive to genetic study. For 
these and other reasons, animal models of human diseases are 
a useful alternative to the direct study of the disease in humans. 

The mouse is commonly used as an animal model of human 
disease because it is a well-understood and easily manipulated 
experimental system and because many developmental genes 


are conserved in most mammalian species. In some cases, a 
naturally occurring animal model of a human genetic disease 
exists (e.g., dog and mouse models for muscular dystrophy), 
but natural mouse models are relatively uncommon. To over¬ 
come this difficulty, human genes can be inserted directly into 
mouse embryonic stem cells, which are then placed into 
mouse embryos to create a transgenic mouse. The expression 
of the human gene can then be studied directly in mouse 
embryos. It is also possible to use targeted disruption to alter 
a specific mouse gene so that it is not expressed. This is a 
knockout model. Mice that are heterozygous for the disrupted 
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BOX 10-1 Animal Models in the Study of Human Development—cont'd 


gene can be bred to produce homozygotes that produce no 
functional protein from the gene targeted. Many human genetic 
diseases have been studied using mouse knockouts, includ¬ 
ing neurofibromatosis type 1, Gaucher disease, Huntington 
disease, myotonic dystrophy, fragile X syndrome, cystic 


fibrosis, and subtypes of Alzheimer disease. Mouse knockouts 
have been so important for studying development and human 
conditions that an international consortium was formed to sys¬ 
tematically produce knockouts of every protein-coding gene in 
the mouse genome. This initiative is well underway. 



Isolate 

blastocyst 



Reimplant into 
foster mother 



Isolate inner 
cell mass 




Cultured ES cells 


Gene targeting 
or insertion of 
transgene 




Inject into 
cavity of 
blastocyst from a Genetically 
different mouse strain modified 
ES cells 


Backcross 



Mouse containing homozygous Heterozygous Normal mouse 

genetic modification in all mouse 

nucleated cells, “knockout” 


Construction of an animal model. Blastocysts are collected from a pregnant mouse that has a 
marker that identifies its strain (e.g., light coat color). The inner cell mass is isolated, and embry¬ 
onic stem (ES) cells are cultured. ES cells can be modified to introduce foreign genes (creating 
a transgenic animal) or to disrupt the normal function of an endogenous gene (creating a knockout 
animal). Genetically modified ES cells are implanted into blastocysts from a different mouse 
strain that has a marker recessive to the marker in the modified strain (e.g., dark coat color is 
recessive to light coat color). Modified blastocysts are injected into a pseudopregnant surrogate 
mouse. Development of introduced blastocysts results in chimeric animals with two populations 
of cells (i.e., some cells have the genetic modification and other cells do not). Chimeras can be 
detected by the presence of two markers in the same mouse (e.g., two different coat colors in 
the same mouse). Backcrossing of chimeras and mating of heterozygotes can produce mice that 
are homozygous for a genetic modification (e.g., a knockout), heterozygous for a genetic modi¬ 
fication, or normal. (Modified from Strachan T, Read AP. Human Molecular Genetics. Oxford: 
Bios Scientific: 1996.) 
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BOX 10-1 Animal Models in the Study of Human Development—cont'd 


In addition to their roles in morphogenesis and organogene¬ 
sis, some genes are critical for early embryogenesis. Conse¬ 
quently, knocking out their function results in embryonic 
lethality. This makes it difficult to study the role of these genes 
using targeted disruption. One way to overcome this problem 
is to condition the disruption of a gene so that it takes place 
only in a certain type of cell (e.g., neural crest), in a specific 
tissue (e.g., the limb), or at a specific time during development. 
This is a conditional knockout. For example, constitutional 
disruption of Fgf8 is lethal in early embryogenesis. To study 
the effect of inactivating Fgf8 in the limb, a mouse can be 
engineered so that the function of Fgf8 is disrupted only in the 
apical ectodermal ridge (AER) of the forelimb bud. The result 
is a live-born mouse whose forelimbs are severely truncated 
but whose other organs and body areas are all normal. 

The methods used to produce knockout mice are relatively 
inefficient and expensive, and they cannot be used to generate 
knockouts in most other species. In addition, many human 
conditions are caused by single base substitutions or small 
deletions/insertions that do not inactivate a gene. Accordingly, 
developing accurate mouse models that enable functional char¬ 
acterization of such mutations has proved to be extremely 
challenging. Recently, several new gene targeting technologies 
have been Introduced to engineer subtle changes in the 
mouse genome. Zinc-finger nucleases (ZFNs) and transcrip¬ 
tion activator-like effector nucleases (TALENs) are classes 
of engineered nucleases that have proven effective for produc¬ 
ing targeted mutations with high efficiency not only in ES cells 
but also when Injected directly into fertilized mouse oocytes. 

More recently, an even more powerful strategy to edit the 
genome has been introduced. This system is based on bacterial 
enzymes that degrade nucleic acids of invading bacterial 
pathogens (i.e., phages and plasmids). The system consists of 
clustered regularly interspersed short palindromic repeat 
(CRISPR) genes and CRISPR-associated (i.e., Cas) genes. The 
CRISPR/Cas system can be used to generate mutations in the 
genomes of multiple different species, and multiple genes can 
be targeted simultaneously. This system offers the potential to 
substantially increase the throughput with which targeted muta¬ 
tions can be created and substantially reduces the cost of creat¬ 
ing new mutant mouse alleles. These new systems for editing 
the genomes of mice and other organisms offer exciting oppor¬ 
tunities to explore the functional consequences of the hundreds 
of thousands of sites in human protein-coding genes that vary 
among persons. 

Animal models do not always mimic their human counter¬ 
parts accurately. Sometimes this reflects differences in the 


interactions of gene products in the model system and the 
human. Such differences might account for the fact that a 
heterozygous mouse knockout of the retinoblastoma homolog 
(RB) develops pituitary tumors instead of retinoblastomas. In 
some cases the knockout has little detectable effect, possibly 
reflecting genetic redundancy: even though the expression of 
one gene product is blocked, a backup system might compen¬ 
sate for its loss. Thus, the mouse knockout of either Hoxal 1 
or Hoxd^ 1 alone has little phenotypic effect, but the simultane¬ 
ous knockout of both genes produces a severe reduction in 
length of the radius and ulna. Despite these potential short¬ 
comings, the Introduction or disruption of genes in mice and 
other model systems can be a powerful approach for analyzing 
human genetic disease. 



Conditional mutants that lack FgfQ expression in the apical 
ectodermal ridge (AER) of the forelimb. A, In situ hybridiza¬ 
tion demonstrating Fgf8 expression (dark band; arrow) 
in the AER of the developing limbs of a wild-type mouse. 
B, In the conditional mutant, no Fgf8 is expressed in the 
forelimb (no dark band; arrow), although it is expressed in 
the hindlimb bud. C, Normal forelimb in the wild-type mouse 
(arrow). D, Severely hypoplastic limb in the conditional 
mutant (arrow). (Courtesy Dr. Anne Moon, Geisinger Clinic 
and the University of Utah.) 


CLINICAL COMMENTARY 10-3 

Laterality Defects: Disorders of the Left/Right Axis 

Left/right (L/R) asymmetry is common in nature. For example, 
all animals use only L-amino acids and D-sugars. Likewise, all 
vertebrates have asymmetrical structures that are consistently 
oriented L/R of the midline. For example, looping of the heart 
tube toward the right, the first observable sign of L/R asym¬ 
metry in the embryo, is seen in all chordates. How did L/R 
asymmetry evolve? How and when is L/R asymmetry estab¬ 
lished? Substantial progress is being made toward understand¬ 


ing the molecular basis of L/R asymmetry so that answers may 
be obtained for these questions. This is important because 
disorders of L/R asymmetry (i.e., laterality defects) are found 
in approximately 1 in 10,000 live births. 

The final position of asymmetrically placed vertebrate struc¬ 
tures Is determined by at least three different mechanisms. 
Unpaired organs In the chest and abdomen (e.g., heart, liver) 
begin their development in the midline and then lateralize to 
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CLINICAL COMMENTARY 10-3 

Laterality Defects: Disorders of the Left/Right Axis—cont'd 

their adult positions. The mirror image of a paired structure can 
regress, leaving a lateralized, unpaired structure (e.g., some 
blood vessels). Some organs (e.g., lungs) begin as asymmetri¬ 
cal outgrowths from a midline structure. It is unknown whether 
the molecular basis of laterality determination differs among 
these mechanisms. Disorders of tyR asymmetry can cause 
randomization (situs ambiguus) or L/R reversal of organ posi¬ 
tion (situs inversus). These defects may be limited to a single 
organ (as with a right-sided heart, or dextrocardia), or they may 
include many organs with L7R asymmetry (e.g., stomach and 
spleen). Midline defects (e.g., cleft palate, neural tube defects) 
are observed in nearly half of all persons with disorders of L/R 
asymmetry. 

Initially, establishment of L/R asymmetry requires a mecha¬ 
nism that generates asymmetry. The mechanism(s) by which 
symmetry is "broken" is unknown and the topic of much study 
and debate. In fish, amphibians, mice, and humans, motile cilia 
on the surface of cells of a left-right organizer (LRO) at the 
posterior end of the notochord create a leftward flow of fluid 
in the extracellular space into which it appears that morpho- 
gens involved in symmetry breaking are released. Flow is 
sensed by cells bordering the LRO and these cells express the 
morphogens. Nodal and Coco, an inhibitor of Nodal. Because 
of the leftward flow of fluid. Coco becomes down-regulated, 
leading to the accumulation of Nodal. Nodal is cleaved into its 
active form by a protease encoded by the gene, PCSK6, and 
Nodal signaling is transduced through the transmembrane 
receptor encoded by ACVR2B. Nodal signaling initiates a 
cascade of gene expression in the lateral plate mesoderm of 
its own inhibitor. Lefty, as well as the homeobox transcription 
factor, Pitx2. The mechanism by which symmetry is broken in 
other model organisms such as chicken appear to be different 
(Fig. 10-9). 

The function of these cilia partly depends on the expression 
of two proteins, left-right dynein (Ird) and polycystin-2. Dynein 
abnormalities in humans cause a group of disorders called 


primary ciliary dyskinesias, in which most individuals have situs 
inversus. Abnormal ciliary function is also associated with 
recurrent sinusitis, infertility, and hydrocephalus. Mutations in 
PKD1, the gene encoding polycystin-2, produce laterality 
defects in mice and autosomal dominant polycystic kidney 
disease in humans (see Chapter 4). 

Although more than 75 genes are known to be required for 
normal L/R development in model organisms, mutations in only 
a few of these have been found to cause laterality defects in 
humans. Mutations in zinc-finger protein of the cerebellum 
(ZIC3), a member of the Gli transcription factor family located 
on the X chromosome, are the most common known genetic 
cause of human laterality defects. Affected males exhibit ran¬ 
domization defects, and some carrier females have L/R rever¬ 
sal. In Drosophila, some members of the Gli family are known 
to be regulated by forming a complex with costal2, a motor 
molecule similar to dynein. This could explain how mutations 
in genes encoding dissimilar proteins could cause both human 
laterality disorders. Other genes that cause laterality defects 
include LEF-TYA, CRYPTIC, and ACVR2B. 

Once L/R asymmetry has been established in the early 
embryo, the left and right sides of individual organs must also 
be patterned. For example, two related transcription factors, 
dHAND and eHAND, play roles in patterning the right and left 
ventricles of the heart. In mice, homozygous mutation of 
dHAND produces animals that fail to form a right ventricle, 
indicating that dHAND participates in cardiac differentiation. 

Abnormalities of L/R asymmetry are found more often in 
human conjoined twins than in singletons or dizygotic twins. 
Most commonly, it is the twin arising on the right side that 
exhibits randomization of the L/R information. It has been sug¬ 
gested that randomization of the right-sided twin is caused by 
inadequate signaling from the left-sided embryo. A candidate 
for this signaling molecule, discovered in frogs, is Vgl. This 
suggests a possible molecular pathway for the formation of 
birth defects in human conjoined twins. 



FIG 10-9 Abnormalities of left/right (L/R) asymmetry in humans. A, The normal L/R positions of 
organs arranged along midline (situs solitus). The apex of the heart points toward the left. The 
right lung is trilobed and the left lung is bilobed. In the abdominal cavity, the spleen and stomach 
are positioned on the left side and the liver is on the right. The small bowel is looped in a coun¬ 
terclockwise direction. B, A complete mirror image of organ arrangement along the midline is 
called situs inversus. Persons with situs inversus might have no symptoms. C, Randomization 
of the arrangement of the heart, lungs, liver, spleen, and stomach along the midline (situs 
ambiguus, or heterotaxy). Situs ambiguus is often associated with congenital heart defects. 
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degenerates). This mesenchyme becomes the muscles and 
bones of the neck. The fate of each group of neural crest cells 
is specified by Hox genes. For example, functional inactiva¬ 
tion of Hoxa3 results in mice with small or absent thymuses 
and thyroid and parathyroid glands, as well as malformations 
of the heart and major blood vessels. Although the number 
of neural crest cells in these mice is normal, they lack fate 
information and thus fail to proliferate and differentiate. 
These defects are similar to those found in children with dele¬ 
tions of chromosome 22qll (see Chapter 6). 

The bones of the skull develop directly from mesenchyme 
produced by neural crest cells. Complete fusion of these 
bones usually does not occur until adulthood. Premature 
fusion (synostosis) of the skull bones (craniosynostosis) 
causes the head to be misshapen and can impair brain growth. 
Often, craniosynostosis is associated with additional birth 
defects (e.g., hearing loss). Many of the craniosynostosis syn¬ 
dromes are caused by mutations in FGFR genes (see Clinical 
Commentary 10-1). Craniosynostosis also can be caused by 
mutations in MSX2, a transcription factor that may play a 
role in controlling the programmed death of neural crest cells 
in the skull. 

Craniosynostosis is also a feature of Greig cephalopolysyn- 
dactyly, a disorder caused by mutations of the gene encoding 
GL13, a zinc-finger transcription factor. GLI3 encodes at least 
seven conserved domains, including DNA-binding, zinc- 
finger, and microtubular anchor domains. Studies of the Dro¬ 
sophila homolog of GLI3 suggest that this gene may be 
regulated so that it can have either activator or repressor 
functions. Mutations that cause Greig cephalopolysyndactyly 
occur in the carboxy-terminal portion of GLI3, eliminating 
both its activator and repressor functions. Mutations in the 
region between the zinc-finger and the microtubular anchor 
domains produce a protein in which the amino terminal is 
cleaved so that it can migrate to the nucleus and repress 
transcription. Such mutations in GLI3 cause a disorder called 
Pallister-Hall syndrome, characterized by hypothalamic 
hamartomas, visceral anomalies, and posterior polydactyly. 
Mutations 3' of the microtubular anchor domain produce a 
protein that retains both repressor and activator functions. 
These mutations have been described in persons with isolated 
posterior polydactyly, a relatively minor birth defect. There¬ 
fore, mutations in GLI3 alter the balance between its activator 
and repressor functions and cause three distinct disorders of 
varying severity. In addition, mutations that cause loss of 
function of a protein that acts as a cofactor of Gli proteins, 
GREBBP, cause Rubenstein-Taybi syndrome, a disorder char¬ 
acterized by intellectual disability, distinctive facial features, 
and broad thumbs (see Fig. 10-2). 

The majority of craniofacial structures are 
derived from neural crest cells. The fate of 
each group of neural crest cells is specified by 
homeobox-containing genes. Some of the 
genes controlling craniofacial development 
have been isolated by analysis of craniosynos¬ 
tosis syndromes. 


Development of the Limb 

The limb is our best-understood classic model of develop¬ 
ment. Surgical manipulation, ectopic gene expression, and 
targeted disruption of genes in animal models (see Box 10-1) 
have led to the isolation and characterization of many of the 
genes controlling growth and patterning of the limb. Many 
of the signaling pathways and transcriptional control ele¬ 
ments that coordinate limb development in model organisms 
such as Drosophila and chick appear to be conserved in 
mammals. Because the newborn prevalence of limb defects is 
second only to that of congenital heart defects, the limb defect 
phenotypes are well documented. As a result, our knowledge 
of the molecular basis of human limb defects has expanded 
rapidly. Flowever, the exact mediators and mechanisms of 
growth and patterning remain controversial and appear to 
differ between the most popular animal models (e.g., chick 
and mouse) studied. 

The vertebrate limb is composed of elements derived 
from lateral plate mesoderm (bone, cartilage, and tendons) 
and Semitic mesoderm (muscle, nerve, and vasculature). 
Limb bud formation involves multiple steps including speci¬ 
fication of where a limb will form along the anterior- 
posterior axis and induction of outgrowth. There is substantial 
evidence that specification along the anterior-posterior axis 
is controlled by the expression of FIox genes in the paraxial 
mesoderm. The next step in the formation of a limb is its 
induction. The signal that initiates induction is not clear, 
although many genes have been implicated. Expression of the 
genes Tbx5 and Tbx4 are essential for forelimb and hindlimb 
development, respectively. Tbx4 and Tbx5 are members of a 
highly conserved family of DNA transcription factors con¬ 
taining a DNA-binding domain called a T-box. Tbx5 and 
Tbx4 proteins regulate the expression of PgflO, which is 
essential for limb bud initiation, in the lateral plate meso¬ 
derm. FgflO activates Wnt/FgfS signals in the overlying ecto¬ 
derm to induce the formation of a specialized region called 
the apical ectodermal ridge (AER), which extends from ante¬ 
rior to posterior along the dorsal/ventral boundary of the 
limb bud (Fig. 10-10). 

Once initiated, proximal/distal growth of the limb bud is 
dependent on the AER. Before AER differentiation, two genes. 
Radical fringe (r-Fng) and Wnt7a, are expressed in the dorsal 
ectoderm. In the ventral ectoderm, expression of r-Fng 
and Wnt7a is blocked by Engrailed-1 (En-l), a homeobox- 
containing transcription factor. Expression of Wnt7a instructs 
the mesoderm to adopt dorsal characteristics. Mesoderm in 
which Wnt7a expression is blocked becomes ventralized. 
Thus, the processes of AER formation and dorsal/ventral pat¬ 
terning are interconnected and coordinated by En- 1. In the 
mouse, functional inactivation of Wnt7a results in ventraliza- 
tion of the dorsal surface (i.e., pads on both sides of the foot). 
Ventralization of the dorsal surface of the limb has been 
described in humans, but the etiology remains unknown. 

Mediation of proximal/distal growth by the AER is con¬ 
trolled in part by FGFs (e.g., FGF2, EGEA, FGE8) that stimu¬ 
late proliferation of an underlying population of mesodermal 
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cells in the progress zone (PZ). Maintenance of the AER is 
dependent on a signal from a posterior portion of the limb 
bud known as the zone of polarizing activity (ZPA). The sig¬ 
naling molecule of the ZPA is Shh, which is also responsible 
for dorsal/ventral patterning of the central nervous system 
and for establishing the embryonic left/right axis. The ZPA 
also specifies positional information along the anterior/ 
posterior axis of the limb bud. 

Defects of the anterior and posterior elements of the 
upper limb occur in the Holt-Oram syndrome and ulnar- 

Dorsal 



FIG 10-10 Schematic illustration of a limb bud. The apical 
ectodermal ridge (AER) extends from anterior to posterior 
along the dorsal/ventral boundary of the limb bud. Proximal 
to the AER is a region of rapidly proliferating mesodermal 
cells called the progress zone (PZ). Located in the posterior 
mesoderm is an important signaling center called the zone 
of polarizing activity (ZPA). The signaling pathways of the AER, 
PZ, and ZPA are interconnected so that pattern formation and 
growth partly depend on their coordinated function. 


mammary syndrome, respectively (Fig. 10-11). Holt-Oram 
syndrome is caused by mutations in the gene TBX5, and 
ulnar-mammary syndrome is caused by mutations in the 
tightly linked gene TBX3. TBX3 and TBX5 evolved from a 
common ancestral gene, and each has acquired specific yet 
complementary roles in patterning the anterior/posterior axis 
of the mammalian upper limb. TBX3 and TBX5 also play 
roles in the development of many other organs. For example, 
persons with Holt-Oram syndrome also have congenital 
heart defects, most commonly an atrial septal defect that 
allows blood in the left and right atria to mix. TBX5 interacts 
with another transcription factor, Nkx2-5, during heart 
development. Mutations in the gene encoding Nkx2-5 also 
cause atrial septal defects. Thus, perturbation of two different 
mediators in the same developmental program can produce 
the same type of birth defect. 

If the early events of limb bud signaling provide positional 
information to developing cells, what controls the growth 
and differentiation of these cells? One important component 
is the transcription factors encoded by Hox genes. The expres¬ 
sion patterns of Hoxa9 through Hoxa\3 define overlapping 
domains along the proximal/distal axis of the developing 
limb bud. Combinations of Hox paralogs preferentially 
promote growth within different segments of the limb 
according to their 5' position within a Hox cluster. For 
example, mice with Hoxall or Hoxdll mutations have only 
minor abnormalities, but Hoxall/Hoxdll double mutants 
exhibit a marked reduction in the size of the radius and ulna. 
Similarly, deletion of increasing numbers of Hox- 13 paralogs 
has a cumulative effect on phenotypic abnormalities of the 
hands or feet, presumably because the functions of Hox para¬ 
logs are partially redundant. 

Because of this redundancy, it was suspected that muta¬ 
tions in Hox genes would be unlikely causes of human birth 
defects. However, to date, mutations in at least 10 HOX genes 
have been found in individuals with birth defects. For 



FIG 10-11 A, Absence of the right third, fourth, and fifth digits (i.e., posterior digits) accompanied 
by aplasia of the ulna and hypoplasia of the radius in a patient with ulnar-mammary syndrome. 
The fifth digit of the right hand is also missing. B, Bilateral absence of the thumb (i.e., an anterior 
digit) and radius with marked hypoplasia of the humerus in an adult with Holt-Oram syndrome. 
(From Jones KL. Smith's Recognizable Patterns of Human Malformation. 6th ed. Philadelphia: 
Mosby: 2006.) 
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example, HOX mutations have been described in persons 
with synpolydactyly and with hand-foot-genital syndrome. 
Synpolydactyly is characterized by duplication and fusion of 
the middle digits of the hands and feet. It is caused by muta¬ 
tions in HOXD13 that produce an expansion of a polyalanine 
tract in the amino-terminal end of the HOXD13 protein. A 
similar defect can be reproduced by simultaneous disruption 
of Hoxdl3, Hoxdll, and HoxdW, suggesting that expansion 
of the polyalanine tract in HOXD13 results in the functional 
inactivation of its 3' neighbors. 

The vertebrate limb is composed of elements 
derived from lateral plate mesoderm and 
Semitic mesoderm. Growth and patterning are 
controlled by proteins secreted from special¬ 
ized collections of cells called the apical ecto¬ 
dermal ridge, the progress zone, and the zone 
of polarizing activity. 

Organ Formation 

Many developmental processes must be coordinated simulta¬ 
neously to construct the specific arrangement of cells and 
tissues that compose an organ. As in limb development, the 
formation of organs involves numerous interactions. These 
interactions are mediated by secreted signaling molecules 
that bind to receptors, conduct signals through various inter¬ 
connected pathways, and stimulate or repress DNA transcrip¬ 
tion. Use of the same elaborate networks to form different 
organs allows genomic economy while maintaining develop¬ 
mental flexibility. 

Once a specialized cell within an organ is terminally dif¬ 
ferentiated, various proteins turn on its molecular machinery 
so that it may perform its intended function. Often, develop¬ 
ment of the organ and function of the differentiated cell are 
interrelated. For example, the endocrine pancreas is largely 
composed of three different cell types: alpha, beta, and 
gamma. Transcription of insulin in beta cells is stimulated by 
the binding of insulin promoter factor 1 (IPFl) to the insulin 
promoter region. Mutations in the gene encoding IPFl 
prevent pancreatic development, indicating that IPFl is nec¬ 
essary for the maturation and differentiation of pancreatic 
precursor cells. 

Interactions between mesenchymal and epithelial cells are 
prominent in the development of cutaneous structures (e.g.. 


hair, sweat glands, breasts), parenchymal organs (e.g., liver, 
pancreas), lungs, thyroid, kidneys, and teeth. These interac¬ 
tions are dynamic in that expression patterns in the epithelia 
and mesenchyme change over time and continue to influence 
one another. For example, during tooth development, the 
epithelium secretes Bmp-4, which signals the underlying mes¬ 
enchyme to express a set of transcription factors, including 
Msxl. The mutual exchange of signals between epithelium 
and mesenchyme leads to formation of a dental papilla and 
cusp and finally to terminal differentiation of mesenchyme 
into tooth-forming odontoblasts. In humans, mutations in 
MSXl disrupt tooth formation and cause loss of the second 
premolars. Similarly, mutations in the human homolog of the 
mouse gene hairless cause loss of all body hair, including that 
of the scalp, eyebrows, axillae, and pubis. 

The integrity of signals exchanged between the epithelium 
and the mesenchyme is dependent on the integrity of these 
tissues. Several proteins produced within the epithelium are 
known to promote the growth and differentiation of the epi¬ 
thelium. One of these proteins is p63, a homolog of the pro- 
totypic tumor suppressor gene product, p53. Mutations that 
perturb the function of p63 decrease the availability of epi¬ 
thelial progenitor cells. This results in abnormalities of the 
limbs, skin, teeth, hair, and nails in at least six different mal¬ 
formation syndromes. 

One of the largest organs in the body is the skeleton. Skel¬ 
etal formation is dependent on bone-forming cells called 
osteoblasts. The differentiation of osteoblasts is regulated by 
an osteoblast-specific transcription factor, Runx2. Targeted 
disruption of Runx2 produces mice with a complete lack of 
ossification of the skeleton. Heterozygous mice have widened 
cranial sutures, shortened digits, and abnormalities of the 
shoulder girdle. Similar defects are found in persons with 
cleidocranial dysplasia, which is caused by mutations in 
Runxl, the human homolog of mouse Runx2. 

Formation of organs involves reciprocal inter¬ 
actions between epithelium and mesenchyme. 
This interaction is mediated by secreted sig¬ 
naling molecules that bind to receptors, 
conduct signals through various intercon¬ 
nected pathways, and stimulate or repress 
DNA transcription. 




I STUDY QUESTIONS 


1. Explain how nonhuman animal models are useful for 
studying human development and birth defects. Give at 
least one example. 

2. Mutations in fibroblast growth factor receptors (FGFRs) 
cause at least six different craniosynostosis syndromes. 
Moreover, the same mutation in PGFR2 causes Pfeiffer 
syndrome in some families but Crouzon syndrome in 
other families. How can the same mutation produce two 
distinct disorders? 


3. Disorders caused by mutations in genes encoding tran¬ 
scription factors are often pleiotropic. Explain this 
finding. 

4. Define pattern formation, and give an example of a birth 
defect caused by disruption of this process. 

5. If the control of developmental processes is tightly 
regulated, how can you explain that mutations in some 
developmental genes (e.g., Hox genes) produce subtle 
phenotypes? 
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6. Loss-of-function mutants in mammals are commonly 
studied by creating a knockout mouse model. Explain why 
it is impractical to use some organisms (e.g., baboons) to 
generate knockouts. Can you think of a way to circumvent 
some of these obstacles? 


7. Give an example of a birth defect that can be caused by a 
defect in a ligand or its receptor. 

8. Explain what may be some of the obstacles to using gene 
therapy to treat birth defects. 
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Cancer Genetics 


Present evidence indicates that approximately one in four 
deaths is now due to cancer and that invasive cancer will be 
diagnosed in more than half of the population at some point 
in their lives. Many cancers are increasing in incidence, largely 
as a result of the increased average age of our population. 

As shown in this chapter, the causes of cancer are a mixture 
of environmental and genetic alterations that occur in our 
tissues. Inherited predisposition plays a role in some families. 
Dramatic advances in molecular biology and genetics have 
now clarified the basic molecular elements of cancer and 
provide a schematic outline of the cellular events leading to 
cancer. This understanding will be crucially important in the 
control of cancer, providing the beginnings of a base of 
knowledge that should lead to significantly improved thera¬ 
pies and preventive strategies. 

“Cancer” is a collection of disorders that share the common 
feature of uncontrolled cell growth. This leads to a mass of 
cells termed a neoplasm (Greek, “new formation”), or tumor. 
The formation of tumors is called tumorigenesis. Several key 
events must occur if cells are to escape the usual constraints 
that prevent uncontrolled proliferation. Additional growth 
signals must be produced and processed, and cells must 
become resistant to signals that normally inhibit growth. 
Because these abnormal characteristics would typically 
trigger the process of programmed cell death (apoptosis), 
cells must somehow disable this process. The growing cell 
mass (tumor) requires nourishment, so a new blood supply 
must be obtained through angiogenesis (the formation of 
new blood vessels). Additional inhibitory signals must be 
overcome for the tumor to achieve a malignant state, in 
which neoplasms invade nearby tissues and metastasize 
(spread) to more distant sites in the body. The capacity to 
invade and metastasize distinguishes malignant from benign 
neoplasms. 

Tumors are classified according to the tissue type in which 
they arise. Major types of tumors include those of epithelial 
tissue (carcinomas, the most common tumors), connective 
tissue (sarcomas), lymphatic tissue (lymphomas), glial cells 
of the central nervous system (gliomas), and hematopoietic 
organs (leukemias). The cells that compose a tumor are 
usually derived from a single ancestral cell, making them a 
single clone (monoclonal). 

Many of the basic biological features of carcinogenesis 
(cancer development) are now understood. Throughout our 
lives, many of our cells continue to grow and differentiate. 
These cells form, for example, the epithelial layers of our 


lungs and colons and the precursor cells of our immune 
systems. Relatively undifferentiated stem cells produce large 
numbers of progeny cells to repopulate and renew our worn 
defensive layers. Through integration of information pro¬ 
vided by a complex array of biochemical signals, the new cells 
eventually stop dividing and terminally differentiate into a 
cell type appropriate for their role in the body (Fig. 11-1). 
Alternatively, if the cell is abnormal or damaged, it may 
undergo apoptosis. 

Occasionally one of these cells fails to differentiate and 
begins to divide without restraint. The descendants of such 
cells can become the founders of neoplasms, capable of 
further transformation into invasive, metastatic cancer. We 
wish to understand in detail what has gone wrong in these 
cells, to detect them early, and ultimately to intervene in their 
development so as to eliminate them. 

Cells in the body are programmed to develop, 
grow, differentiate, and die in response to 
a complex system of biochemical signals. 
Cancer results from the emergence of a clone 
of cells freed of these developmental program¬ 
ming constraints and capable of inappropriate 
proliferation. 

CAUSES OF CANCER 
Genetic Considerations 

Genetic alterations of cell regulatory systems are the primary 
basis of carcinogenesis. We can create cancer in animal 
models by damaging specific genes. In cell culture systems, 
we can reverse a cancer phenotype by introducing normal 
copies of the damaged genes into the cell. Most of the genetic 
events that cause cancer occur in somatic cells. The frequency 
of these events can be altered by exposure to mutagens, thus 
establishing a link to environmental carcinogens (cancer- 
causing agents). However, these genetic events are not trans¬ 
mitted to future generations because they occur in somatic, 
rather than germline, cells. Even though they are genetic 
events, they are not inherited. 

It is also possible for cancer-predisposing mutations to 
occur in germline cells. This results in the transmission of 
cancer-causing alleles from one generation to the next, pro¬ 
ducing families that have a high incidence of specific cancers 
(Fig. 11-2). Such “cancer families,” although rare, demon¬ 
strate that inheritance of a damaged gene can cause cancer. 
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In these families, inheritance of one mutant allele seems suf¬ 
ficient to cause a specific form of cancer. Most individuals 
who inherit the mutant allele will develop a tumor. This is 
because each of their cells now carries the altered gene and 
thus has already taken the first step down the cancer pathway. 
The childhood cancer of the eye, retinoblastoma, is a good 
example. As discussed in Chapter 4, those who inherit a 
mutant version of the retinoblastoma gene have approxi¬ 
mately a 90% chance of developing one or more retinoblas¬ 
toma tumors. 

Although the transmission of cancer as a single-gene dis¬ 
order is relatively uncommon, there is good evidence for 
more frequent clustering of some cancer types in families. For 
many kinds of cancer, such as those of the breast and colon, 
the diagnosis of the cancer in a first-degree relative implies at 
least a twofold increase in one’s risk of developing the cancer. 
It is very likely that the inheritance of altered forms of specific 
genes is at least partly responsible for this increased risk. 

The extent to which each of these mechanisms—inherited 
germline mutations versus mutations occurring in somatic 
cells—contributes to human cancer is an important question. 
If inherited predispositions are significant determinants of 
a person’s risk of acquiring a specific form of cancer, it 
should be possible to identify those in whom the risk is ele¬ 
vated. More intensive screening of defined high-risk groups 
can result in early detection and intervention, leading to 
better prognoses for patients and lowered morbidity and 
mortality. 


Signals 

Growth factors 
Steroid hormones 
Cell-cell interactions 


Differentiation 

Growth and cell 
division 


Cell death 


FIG 11-1 In response to environmental signals, a cell may 
continue to divide or it may differentiate or die (apoptosis). 



The basic cause of cancer is damage to spe¬ 
cific genes. Usually, mutations in these genes 
accumulate in somatic cells over the years, 
until a cell accumulates a sufficient number of 
errors to initiate a tumor. If damage occurs in 
cells of the germline, however, an altered 
form of one of these genes can be transmitted 
to progeny and predispose them to cancer. 
The increased risk of cancer in such persons 
is due to the fact that each of their cells now 
carries the first step in a multistep cancer 
pathway. 


Environmental Considerations 

What is the role of the nongenetic environment in carcino¬ 
genesis? At the level of the cell, cancer seems intrinsically 
genetic. Tumor cells arise when certain changes, or muta¬ 
tions, occur in genes that are responsible for regulating the 
cell’s growth. However, the frequency and consequences of 
these mutations can be altered by a large number of environ¬ 
mental factors. It is well documented, for example, that many 
chemicals that cause mutation in experimental animals also 
cause cancer and are thus carcinogens. Furthermore, other 
environmental agents can enhance the growth of genetically 
altered cells without directly causing new mutations. Thus, it 
is often the interaction of genes with the environment that 
determines carcinogenesis; both play key roles in this process. 

Two additional lines of argument support the idea that 
exposure to environmental agents can significantly alter a 
person’s risk of cancer. The first is that a number of environ¬ 
mental agents with carcinogenic properties have been identi¬ 
fied. For example, epidemiological studies and laboratory 
experiments have shown that cigarette smoke causes lung and 
other types of cancer. Roles for other environmental agents 
in specific cancers are also well documented (e.g., uranium 
dust in lung cancer among miners, asbestos exposure in lung 
cancer and mesothelioma). 

The second line of argument is based on epidemiological 
comparisons among populations with differing lifestyles. 


II 

III 

IV 


6b6i-rO 6 










¥1 



V 


FIG 11-2 A familial colorectal cancer pedigree. Darkened symbols represent individuals with 
diagnosed colorectal cancer. 
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Many kinds of cancer have quite different frequencies in dif¬ 
ferent populations. Breast cancer, for example, is prevalent 
among northern Europeans and European Americans but 
relatively rare among women in developing countries. It is 
usually difficult to determine whether such dissimilarities 
reflect differences in lifestyle or in gene frequencies. 

Examination of genetically similar populations under dif¬ 
fering lifestyles, however, provides an opportunity to evalu¬ 
ate the genetic and environmental components of cancer. 
Epidemiological studies among migrant Japanese popula¬ 
tions have yielded important findings with respect to colon 
cancer. This type of cancer was, until recently, relatively rare 
in the Japanese population living in Japan, with a lifetime 
risk of 0.5%, but it is 10 times more common in the United 
States. Stomach cancer, on the other hand, is common in 
Japan but relatively rare in the United States. These statistics 
in themselves cannot distinguish environmental from genetic 
influences in the two populations. However, because large 
numbers of Japanese have immigrated, first to Hawaii and 
then to the U.S. mainland, we can observe what happens to 
the rates of stomach and colon cancer among the immi¬ 
grants. It is important to note that many of the Japanese 
immigrants maintained a genetic identity, marrying largely 
among themselves. Among the first-generation Japanese in 
Hawaii, the incidence of colon cancer rose several-fold—not 
yet as high as in the U.S. mainland, but higher than in Japan. 
Among second-generation Japanese Americans on the U.S. 
mainland, the colon cancer rate rose to 5%, equal to the U.S. 
average. At the same time, stomach cancer has become rela¬ 
tively rare among Japanese Americans. 

These observations strongly suggest an important role for 
environment or lifestyle in the etiology of colon cancer. In 
each case, diet is a likely culprit: a high-fat, low-fiber diet in 
the United States is thought to increase the risk of colon 
cancer, whereas techniques used to preserve and season the 
fish commonly eaten in Japan are thought to increase the 
risk of stomach cancer. It is also interesting that the inci¬ 
dence of colon cancer in Japan has increased dramatically 
during the past several decades, as the Japanese population 
has adopted a diet more similar to that of North America 
and Europe. 

Are we then to assume that genetic factors play no role in 
colon cancer? The fact remains that in the North American 
environment some people will get colon cancer but most will 
not. This distinction can result from differences within this 
environment (e.g., dietary variation) as well as differences in 
genetic predisposition: inherited genes that increase a per¬ 
son’s probability of developing cancer. To account for the 
difference in colon cancer incidence between Japanese living 
in the U.S. and those living in Japan, it is argued that envi¬ 
ronmental features in Japan render the predisposing 
genes less penetrant. Furthermore, a genetic component 
is strongly suggested by the several-fold increase in risk 
to a person when a first-degree relative has colon cancer. 
It is likely, then, that cancer risk is a composite of both genet¬ 
ics and environment, with interaction between the two 
components. 


Environmental factors are known to play 
important roles in carcinogenesis. However, a 
person's overall cancer risk depends on a com¬ 
bination of inherited factors and environmen¬ 
tal components. 

CANCER GENES _ 

Genetic Control of Cell Growth 
and Differentiation 

Cancers form when a clone of cells loses the normal controls 
over growth and differentiation. More than 100 cancer- 
causing genes that encode proteins participating in this 
regulation have now been identified. Characterization of 
the biochemical activities and interactions of these gene 
products has revealed an increasingly detailed picture of the 
normal regulation of cell growth and differentiation and 
the ways these processes become deregulated by the events of 
carcinogenesis. 

Many features of this fundamental process are now well 
understood (Fig. 11-3). One component of cell regulation is 
mediated by external signals coming to the cell through 
growth factors (e.g., platelet-derived growth factor, epider¬ 
mal growth factor, steroid hormones) produced in other cells. 
Each growth factor interacts with specific growth factor 


Growth factor 



FIG 11-3 The major features of cellular regulation. External 
growth factors (proteins and steroid hormones such as epi¬ 
dermal growth factor) bind to membrane-spanning growth 
factor receptors on the cell surface, activating signal- 
transduction pathways in which genes such as HAS partici¬ 
pate. Components of the signal-transduction pathway in turn 
interact with nuclear transcription factors, such as Myc and 
Eos, which can bind to regulatory regions in DMA. mRNA, 
Messenger RNA. 
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receptors located on the cell surface. Binding of a growth 
factor activates the receptor, triggering molecules that send 
messages to the cell’s nucleus in the process of signal trans¬ 
duction. These signal transduction molecules include protein 
kinases, such as Src tyrosine kinase, mitogen-activated 
protein kinase (MAPK), and Jun kinase (JunK), which can 
alter the activity of target proteins hy tagging them at a spe¬ 
cific site with a phosphate molecule (phosphorylation). The 
ultimate stage of the signal transduction pathway is regula¬ 
tion of DNA transcription in the nucleus. Components of the 
signal transduction cascade interact with nuclear transcrip¬ 
tion factors that regulate the activity of specific genes whose 
protein products influence cellular growth and proliferation. 
Genes that encode these transcription factors include MYC, 
FOS, and JUN. 

After several rounds of cell division, cells normally receive 
signals that tell them to stop growing and to differentiate into 
specialized cells. The signals may come from polypeptides, 
from steroid hormones, from direct contact with adjacent 
cells, or from internal programs that define the number of 
cell divisions that are allowed. The signals are transduced to 
the nucleus of the recipient cell. Here, hy altering the tran¬ 
scription patterns of genes that govern the steps of the cell 
cycle, they repress genes that promote cell division and induce 
genes that inhibit entry into the cell division cycle. 

The regulation of cell growth is accomplished 
by substances that include: (1) growth factors 
that transmit signals from one cell to another; 
(2) specific receptors for the growth factors; (3) 
signal transduction molecules that activate a 
cascade of phosphorylating reactions within 
the cell; and (4) nuclear transcription factors. 
The cell integrates and interprets the host of 
signals it receives from its environment. Deci¬ 
sions to grow and divide, or to stop growing 
and differentiate, result from processing of 
these signals. 

A cancer cell can emerge from within a population of 
growing cells through the accumulation of mutations in 
these genes. Although such mutations occur only rarely, 
these cells can fail to respond to differentiation signals 
and continue to divide instead of undergoing their normal 
differentiation program. Furthermore, cancers seem usually 
to result from a progressive series of events that incre¬ 
mentally increase the extent of deregulation within a cell 
lineage. Eventually, a cell emerges whose descendants multi¬ 
ply without appropriate restraints. Further changes give 
these cells the capacity to invade adjacent tissues and form 
metastases. Each of these changes involves mutations, and 
the requirement for more than one mutation has been char¬ 
acterized as the multi-hit concept of carcinogenesis. An 
example of this concept is given by colorectal cancer, in 
which several genetic events are required to complete the 
progression from a benign growth to a malignant neoplasm 
(see later discussion). 


Mutations can occur in any of the steps 
involved in regulation of cell growth and dif¬ 
ferentiation. Accumulation of such mutations 
within a cell lineage can result in a progressive 
deregulation of growth, eventually producing 
a tumor cell. 

The Inherited Cancer Gene versus the 
Somatically Altered Gene 

Although “cancer families” have long been recognized, it was 
not until the early 1970s that we began to understand the 
relationship between the inherited genetic aberrations and 
the carcinogenic events that occur in somatic tissue. In 1971, 
Alfred Knudson’s analysis of retinoblastoma, a disease 
already mentioned as a model of inherited cancer, led him to 
a hypothesis that opened a new window into the mechanism 
of carcinogenesis. In the inherited form of retinoblastoma 
(see Chapter 4), an affected individual often has an affected 
parent, and there is a 50% chance of genetic transmission 
to each of the offspring. In the sporadic (noninherited) 
form, neither parent is affected, nor is there additional risk 
to other progeny. A key feature distinguishing the two forms 
is that inherited retinoblastoma is usually bilateral (affecting 
both eyes), whereas sporadic retinoblastoma usually involves 
only a single tumor and therefore affects only one eye 
(unilateral). 

Knudson reasoned that at least two mutations may be 
required to create a retinoblastoma. One of the mutations 
would alter the retinoblastoma gene; if this happened in the 
germline, it would be present in all cells of a child who 
received the mutant allele. The second mutation would be an 
additional, unspecified genetic event occurring in an already- 
altered cell. The hypothesis of a second event was required to 
explain why only a tiny fraction of the retinoblasts of a person 
who has inherited a mutant retinoblastoma gene actually give 
rise to tumors. Knudson’s hypothesis is known as the two-hit 
model of carcinogenesis. 

Familial retinoblastoma would thus be caused by the 
inheritance of one of the genetic “hits” as a constitutional 
mutation (i.e., a mutation present in all cells of the body). 
Persons who inherited one hit would require only one addi¬ 
tional mutational event in a single retinoblast for that cell to 
seed a tumor clone. In sporadic cases, on the other hand, both 
mutations would have to occur somatically in the developing 
fetus (Fig. 11-4). This is a highly improbable combination of 
rare events, even considering the several million cells of the 
target tissue. The child who developed a retinoblastoma by 
this two-hit somatic route would be unlikely to develop more 
than one tumor. The child inheriting a mutant retinoblas¬ 
toma gene, however, would need only a single, additional 
genetic hit in a retinoblast for a tumor clone to develop. 
Knudson argued that such an event was likely to occur in 
several of the retinoblasts of each carrier of the inherited 
mutant gene, thus explaining the bilaterality of inherited 
retinoblastoma. 

If the first event in the two-hit model can be an inherited 
mutation, what is the nature of the second hit? Extensive 
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FIG 11-4 A, Persons inheriting an RBI mutation are heterozygous for the mutation in all cells 
of their body. The second hit occurs during embryonic development and typically affects more 
than one retinoblast, resulting in multiple tumors. B, In somatic retinoblastoma, both copies of 
the RBI gene must be disabled in the same retinoblast to cause tumor formation. Each process 
leads to homozygosity for the mutant RBI allele and thus to tumor development. 


molecular analysis of the region of chromosome 13 that con¬ 
tains the retinoblastoma-causing gene, RBI, showed that the 
second hit, like the first one, is a loss-of-function mutation. 
Several mechanisms, including point mutation, deletion, and 
hypermethylation of the RBI promoter region (associated 
with decreased transcription; see Chapters 3 and 5), can 
produce this effect. The second hit, which occurs in the fetus 
during the period in which retinoblasts are rapidly dividing 
and proliferating, has removed the remaining normal allele 
of this gene. This implies that a cell with one mutant RBI 
allele and one normal RBI allele cannot form a tumor. Thus, 
the product of the normal gene, even when present only in a 
single copy, prevents tumor formation. 

An important corollary of this two-hit hypothesis is that 
the genes in which inherited mutations cause familial cancer 
syndromes may be the same as those that generate common 
cancers by somatic mutation. By understanding the nature of 
the mutant alleles inherited in rare cancer families, therefore, 
we will come to understand more about the somatic pathway to 
common cancer as well. Indeed, somatic loss-of-function 
mutations of both copies of the RBI gene are seen frequently 
in many types of tumors, including small-cell lung carci¬ 
noma, breast carcinoma, glioblastoma (a brain tumor), and 
osteosarcoma. 


Alfred Knudson's two-hit theory of carcinogen¬ 
esis in retinoblastoma became the paradigm 
for a model to describe how inheritance of an 
altered gene predisposes the gene carrier to 
cancer. The theory states that a cell can initiate 
a tumor only when it contains two damaged 
alleles; therefore, a person who inherits one 
copy of a mutant retinoblastoma gene must 
experience a second, somatic mutation in one 
or more retinoblasts in order to develop one 
or more retinoblastomas. Two somatic muta¬ 
tions can also occur in a single retinoblast of 
a nonpredisposed fetus, producing sporadic 
retinoblastoma. Understanding mutated genes 
that are inherited in families can increase our 
understanding of the somatic pathway to 
common cancers. 

MAJOR CLASSES OF CANCER GENES 

Cancer-causing genes can be classified into three major cat¬ 
egories: those that normally inhibit cellular proliferation 
(tumor suppressors), those that activate proliferation (onco¬ 
genes), and those that participate in DNA repair. 
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TABLE 11-1 Examples of Tumor Suppressor Genes and DNA Repair Genes and Their Roles 
in Inherited Cancer 

GENE (RELATED GENES 


DISEASE CAUSED BY GERMLINE 

IN PARENTHESES) 


FUNCTION OF GENE PRODUCT 

MUTATIONS 

Tumor Suppressor 

RBI (pi 07, pi 30) 

Genes 

Cell cycle brake; binds to E2F transcription factor 

Retinoblastoma; osteosarcoma 

ARC 


complex 

Interacts with p-catenln in Wnt signaling pathway 

Familial adenomatous polyposis 

SMAD4 


Transmits signals from TGF(3 

Juvenile polyposis 

NF1 


Down-regulates RAS protein 

Neurofibromatosis type 1 

NF2 


Cytoskeletal protein regulation 

Neurofibromatosis type 2 

TP53 


Transcription factor; induces cell cycle arrest or 

Li-Fraumeni syndrome 

VHL 


apoptosis 

Regulates multiple proteins, including p53 and NFkB 

Von Hippel-Lindau disease (renal cysts 

WT1 


Zinc finger transcription factor; binds to epidermal 

and cancer) 

Wilms tumor 

CDKN2A (pi4, pi6) 


growth factor gene 

CDK4 inhibitor 

Familial melanoma 

PTEN 


Phosphatase that regulates PI3K signaling pathway 

Cowden syndrome (breast and thyroid 

CHEK2 


Phosphorylates p53 and BRCA1 

cancer) 

Li-Fraumeni syndrome 

PTCH 


Sonic hedgehog receptor 

Gorlin syndrome (basal cell carcinoma, 

CD HI 


E-cadherin; regulates cell-cell adhesion 

medulloblastoma) 

Gastric carcinoma 

DPC4 


Transduces transforming growth factor-p signals 

Juvenile polyposis 

TSC2 


Down-regulates mTOR (mammalian target of 

Tuberous sclerosis 



rapamycin) 


DNA Repair Genes 

MLH1 


DNA mismatch repair 

HNPCC 

MSH2 


DNA mismatch repair 

HNPCC 

BRCA1 


Interacts with BRCA2/RAD51 DNA repair protein 

Familial breast and ovarian cancer 

BRCA2 


complex 

Interacts with RAD51 DNA repair protein 

Familial breast and ovarian cancer 

ATM 


Protein kinase; phosphorylates BRCA1 in response 

Ataxia telangiectasia; conflicting evidence 

XPA 


to DNA damage 

Nucleotide excision repair 

for direct involvement in breast cancer 
Xeroderma pigmentosum 


Tumor Suppressor Genes 

The RBI gene was the first identified example of a tumor 
suppressor gene, a class of genes that control cell division 
and thus help to prevent tumors (Table 11-1). A perplexing 
feature of tumor suppressor genes is that inherited mutations 
are dominant alleles at the level of the individual (i.e., hetero¬ 
zygotes usually develop the disease), but they are recessive 
alleles at the level of the cell (heterozygous cells do not form 
tumors). This apparent contradiction is resolved by realizing 
that in individuals who have inherited the first hit, a second 
hit that occurs in any one cell will cause a tumor. Because 
there are several million target retinoblasts in the developing 
fetus, heterozygous persons form, on the average, several reti¬ 
noblasts homozygous for an RBI mutation. Each of these can 
lead to a retinoblastoma. Thus, it is the strong predisposition 
to tumor formation (i.e., the first hit) that is inherited as an 
autosomal dominant trait. The incomplete penetrance of 
the retinoblastoma mutation (90%) is explained by the fact 


that some people who inherit the disease-causing mutation 
do not experience a second hit in any of their surviving 
retinoblasts. 

A general property of tumor suppressors is that they nor¬ 
mally block the uncontrolled cellular proliferation that can 
lead to cancer. Often, this is done by participating in pathways 
that regulate the cell cycle. For example, the protein encoded 
by RBI (pRb) is active when it is unphosphorylated but 
is down-regulated when it is phosphorylated by cyclin- 
dependent kinases (CDKs) just before the S phase of the cell 
cycle (see Chapter 2). In its active, hypophosphorylated state, 
pRb binds to members of the E2F transcription complex, 
inactivating them (Fig. 11-5). E2F activity is required for the 
cell’s progression into S phase, so its inactivation by pRb halts 
the cell cycle. pRb thus serves as a cell-cycle brake that is 
normally released only when pRb is inactivated through 
phosphorylation by CDKs. This allows the cell to proceed 
through its mitotic cycle until pRb is activated again through 
removal of the phosphate groups. A loss-of-function 
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of p53 


GAP2 

FIG 11-5 Regulation of the cell cycle Is accomplished by a 
complex series of interactions among activators and repres¬ 
sors of the cycle. pRb acts as a master brake on the cell cycle 
by binding the E2F transcription complex, halting the cycle 
before S phase begins. The cyclin D-CDK4 complex inacti¬ 
vates pRb by phosphorylating it, thereby releasing the E2F 
complex and allowing the cell to progress through S phase. 
CDK inhibitors such as pi 6 and p21 inactivate CDKs, thus 
acting as another brake on the cycle. p53, acting through p21, 
can either halt the cell cycle or Induce apoptosis in response 
to DNA damage. CDK, Cyclin-dependent kinase. 


Cell 
:osis cycle 


Synthesis 


/ 


mutation in RBI, a deletion of the gene, or hypermethylation 
of its 5' region can lead to its permanent inactivation. Without 
this brake on the cell cycle, the cell can proceed through 
numerous uncontrolled divisions. 

Loss-of-function mutations of other inhibitory factors can 
also lead to an unregulated cell cycle. A number of tumor 
suppressor genes encode CDK inhibitors (see Fig. 11-5), 
which inactivate CDKs and thus prevent them from phos¬ 
phorylating target proteins such as pRb. Tumor suppressors 
can also control cell proliferation through their effects on 
transcription or on cell-cell interactions (some examples are 
discussed later). Again, mutations in these genes can lead to 
unrestricted cell division and ultimately to cancer. 



The discovery that retinoblastoma results 
when both alleles of the same locus on chro¬ 
mosome 13 are inactivated in the same retino- 
blast led to the concept of tumor suppressor 
genes. The products of such genes suppress 
tumor formation by controlling cell growth 
and can do so even if a cell contains only one 
normal version of the gene. Loss-of-function 
mutations that inactivate both copies of a cell's 
tumor suppressor gene can lead to uncon¬ 
trolled cellular proliferation. 


Because of the pivotal role of tumor suppressors in pre¬ 
venting tumor formation, their study is of considerable 
medical significance. By understanding how cancer is natu¬ 
rally suppressed by the body, we can ultimately develop more 
effective medical therapies for tumor prevention and 
treatment. 


Oncogenes 

Oncogenes (i.e., “cancer genes”) are a second category of 
genes that can cause cancer. Most oncogenes originate from 
proto-oncogenes, which are genes involved in the four basic 
regulators of normal cell growth mentioned previously 
(growth factors, growth factor receptors, signal transduction 
molecules, and nuclear transcription factors). When a muta¬ 
tion occurs in a proto-oncogene, it can become an oncogene, 
a gene whose excessively active product can lead to unregu¬ 
lated cell growth and differentiation. When a cell proceeds 
from regulated to unregulated growth, the cell is said to have 
been transformed. 

Unlike tumor suppressor genes, oncogenes are usually 
dominant at the cellular level: only a single copy of a mutated 
oncogene is required to contribute to the multistep process 
of tumor progression. Whereas tumor suppressors are typi¬ 
cally disabled by deletions or loss-of-function mutations, 
oncogenes are typically activated by gain-of-function muta¬ 
tions, gene amplification (i.e., increased numbers of the gene 
through trisomy or other mechanisms), hypomethylation of 
the oncogene’s 5' region (which increases transcription), or 
chromosome rearrangements that upregulate the oncogene 
(e.g., the Philadelphia chromosome translocation. Chapter 
6). Most tumor suppressor genes are known to exhibit germ¬ 
line mutations that can cause inherited cancer syndromes 
(e.g., retinoblastoma, Li-Fraumeni syndrome). In contrast, 
although oncogenes are commonly found in sporadic tumors, 
germline oncogene mutations that cause inherited cancer 
syndromes are uncommon (a few exceptions are noted later 
in the discussion). These and other differences are summa¬ 
rized in Table 11-2. 

In this section, we review three approaches that have been 
used to identify specific oncogenes: retroviral definition, 
transfection experiments, and mapping in tumors. 



Proto-oncogenes encode products that control 
cell growth and differentiation. When mutated 
or amplified, they can become oncogenes, 
which can cause cancer. Most oncogenes act 
as dominant gain-of-function mutations that 
lead to the deregulation of cell cycle control. 
In contrast to tumor suppressor genes, most 
oncogenes do not exhibit germline muta¬ 
tions that cause inherited cancer syndromes. 
Instead, somatic mutations are seen that lead 
to sporadic cancers. 


It has long been known that certain types of viruses can 
cause cancer. Especially significant are retroviruses, a type of 
RNA virus that is capable of using reverse transcriptase to 
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TABLE 11-2 Comparison of Key Features of Tumor Suppressor Genes and Oncogenes 

FEATURE 

TUMOR SUPPRESSOR GENES 

ONCOGENES 

Function of normal version 

Regulates cell growth and proliferation: 
some can induce apoptosis 

Promotes cell growth and proliferation 

Mutation (at cell level) 

Recessive (both copies of gene inactivated) 

Dominant (only one copy of gene mutated) 

Effect of mutation 

Loss of function 

Gain of function 

Germline mutations resulting in 
inherited cancer syndromes 

Seen in most tumor suppressor genes 

Seen in only a few oncogenes 


TABLE 11- 

3 Examples of Oncogenes and Their Roles in Cancer 

ONCOGENE 

FUNCTION 

ASSOCIATED TUMOR 

Growth Factor 

Genes 


HST 

Fibroblast growth factor 

Stomach carcinoma 

SIS 

P subunit of platelet-derived growth factor 

Glioma (brain tumor) 

KS3 

Fibroblast growth factor 

Kaposi sarcoma 

Growth Factor 

Receptor Genes 


RET 

Receptor tyrosine kinase 

Multiple endocrine neoplasia: thyroid carcinoma 

ERBB 

Epidermal growth factor receptor 

Glioblastoma (brain tumor): breast carcinoma 

ERBA 

Thyroid hormone receptor 

Acute promyelocytic leukemia 

NEU (ERBB2) 

Receptor protein kinase 

Neuroblastoma: breast carcinoma 

MET 

Receptor tyrosine kinase 

Hereditary papillary renal carcinoma: hepatocellular carcinoma 

KIT 

Receptor tyrosine kinase 

Gastrointestinal stromal tumor syndrome 

Signal Transduction Genes 


HRAS 

GTPase 

Carcinoma of colon, lung, pancreas 

KRAS 

GTPase 

Melanoma, thyroid carcinoma, acute monocytic leukemia, 
colorectal carcinoma 

NRAS 

GTPase 

Melanoma 

BRAE 

Serine/threonine kinase 

Malignant melanoma: colon cancer 

ABL 

Protein kinase 

Chronic myelogenous leukemia: acute lymphocytic leukemia 

CDK4' 

Cyclin-dependent kinase 

Malignant melanoma 

Transcription Factor Genes 


NMYC 

DNA-binding protein 

Neuroblastoma; lung carcinoma 

MYB 

DNA-binding protein 

Malignant melanoma; lymphoma; leukemia 

EOS 

Interacts with JUN oncogene to regulate 
transcription 

Osteosarcoma 


*For additional examples, see Croce CM. Oncogenes and cancer. N Engl J Med 2008:358(5):502-511 and Garraway and Lander. Lessons 
from the cancer genome. Ce//2013:153:17-37. 

^CDK4, KIT, MET, and RET are proto-oncogenes in which germline mutations can give rise to inherited cancer syndromes. 


transcribe its RNA into DNA. In this way, the RNA genome 
of the retrovirus is converted to DNA, which can be inserted 
into a chromosome of a host cell. Some retroviruses carry 
altered versions of growth-promoting genes into cells. These 
growth-promoting genes are oncogenes, which were first 
identified through the study of retroviruses that cause cancer 
in chickens. When the retrovirus invades a new cell, it can 
transfer the oncogene into the genome of the new host, thus 
transforming the cell and initiating cancer. 

A number of gene products that affect cell growth or dif¬ 
ferentiation have been identified through the study of onco¬ 
genes carried by transforming retroviruses. For example, 
retrovirus studies identified the gene encoding the receptor 
molecule for epidermal growth factor (EGF), through the 


ERBB oncogene. These studies also identified the RAS {rat 
sarcoma) oncogenes, which are altered in at least 25% of 
human cancers. Transforming retroviruses have also identi¬ 
fied the nuclear transcription factor genes, MYC, fUN, and 
FOS, as other molecular components capable of initiating cell 
transformation. Table 11-3 provides some examples of 
proto-oncogenes. 

Oncogenes have also been identified in experiments in 
which material from human tumor cells was transferred to 
nontumor cells (transfection), causing transformation of the 
recipients. A classic experiment began with the transfer of 
DNA from a human bladder-cancer cell line into mouse cells. 
A few recipient cells became fully transformed. Cloning and 
examination of the human-specific DNA sequences present 
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in the transformed mouse cells revealed that the transform¬ 
ing gene was a mutant allele of the same RAS oncogene previ¬ 
ously identified by retroviral studies. Thus, the same oncogene 
that could be transferred by retroviruses also occurs naturally, 
as a proto-oncogene, in the human genome. 

Characterization of the protein product of mutant forms 
of RAS has revealed an important mechanism for the regula¬ 
tion of signal transduction. The RAS protein normally cycles 
between an active form bound to guanosine triphosphate 
(GTP) and an inactive form bound to guanosine diphos¬ 
phate (GDP). The biochemical consequence of RAS muta¬ 
tions is a RAS protein that is unable to shift from the active 
GTP form, which stimulates growth, to the inactive GDP 
form. The mutant RAS protein cannot extinguish its growth 
signal, contributing to excessive cell division. 

A third approach for identifying oncogenes derives from 
the common observation of chromosomal rearrangements, 
such as translocations, in some types of tumor cells (see 
Chapter 6). A well-known example is the Philadelphia chro¬ 
mosome, in which a translocation between chromosomes 9 
and 22 places the ABL proto-oncogene next to the BCR gene, 
which enhances tyrosine kinase activity and produces chronic 
myelogenous leukemia. Another translocation, t(15;17) 
(q22;ql 1.2-12), is seen in acute promyelocytic leukemia 
(APT) and fuses two genes together: the retinoic acid receptor 
alpha (RARA) gene on chromosome 17 and the promyelocytic 
leukemia (PML) gene on chromosome 15. The fusion product 
(PML-RARa) interferes with the ability of the normal RARa 
protein to induce terminal differentiation of myeloid cells. 
(Interestingly, retinoic acid was already in use as a therapeutic 
agent for APT.) The fusion product also impairs the function 
of the PML protein, which acts as a tumor suppressor by 
helping to initiate apoptosis in damaged cells. 

Retroviruses are capable of inserting onco¬ 
genes into the DNA of a host cell, thus trans¬ 
forming the host into a tumor-producing cell. 
The study of such retroviral transmission has 
identified a number of specific oncogenes. The 
transfection of oncogenes from tumor cells to 
normal cells can cause transformation of the 
normal cells. Some oncogenes were identified 
when specific rearrangements of chromo¬ 
somal material were found to be associated 
with certain cancers. Because these transloca¬ 
tions alter genes vital to cellular growth control, 
the sites of such rearrangements can be inves¬ 
tigated to identify new oncogenes. 

The identification of oncogenes has greatly increased our 
understanding of some of the underlying causes of cancer. In 
addition, oncogenes provide important targets for cancer 
therapy because of their key role in carcinogenesis. For 
example, the ERBB2 oncogene, mentioned earlier and also 
known as HER2/NEU, is amplified in approximately 20% to 
30% of invasive breast carcinomas. Its amplification in breast 
tumor cells, which can be identified by fluorescent in situ 


hybridization (FISH) or array comparative genomic hybrid¬ 
ization (CGH) (see Chapter 6), is associated with aggressive 
cancer. The protein product of EIER2/NEU is a growth factor 
receptor located on the surfaces of breast cancer cells. Iden¬ 
tification of the oncogene and its product contributed to the 
development of drugs, such as trastuzumab, that bind to the 
amplified gene product, effectively down-regulating it and 
helping to treat this form of breast cancer. Similar drugs have 
been developed to counter the effects of the upregulated ABL 
oncogene in chronic myelogenous leukemia, an upregulated 
epidermal growth factor receptor gene in non-small-cell lung 
cancer, and several others. 

DNA Repair Genes, Chromosome Integrity, 
and Tumorigenesis 

Tumor cells typically are characterized by widespread muta¬ 
tions, chromosome breaks, and aneuploidy. This condition, 
termed genomic instability, contributes to tumorigenesis 
because mutations and chromosome defects can activate 
oncogenes or deactivate tumor suppressor genes. Genomic 
instability can occur because of defects in the proteins 
required for accurate cell division or in proteins responsible 
for DNA repair. It is also associated with hypomethylation of 
DNA, a common feature of many tumors. These defects are 
in turn the result of mutations. Sometimes, these mutations 
are inherited, resulting in relatively rare inherited cancer syn¬ 
dromes (see Table 11-1). More often, they arise in somatic 
cells and contribute to common, noninherited cancers. 

There are a number of ways various types of genomic 
instability can give rise to cancer. Some breast cancers are 
caused by defective repair of double-stranded breaks that 
occur in DNA (e.g., from radiation exposure). This can result 
from mutations in genes such as BRCAl, BRCA2, or ATM. 
An inherited form of colon cancer, discussed later, can result 
from faulty DNA mismatch repair (so named because single¬ 
base mutations can lead to a DNA molecule in which base 
pairs are not complementary to each other: a mismatch). 
Xeroderma pigmentosum, an inherited condition that is 
characterized in part by multiple skin tumors (see Chapter 
3), is the result of impaired nucleotide excision repair. Defects 
in proteins responsible for chromosome separation during 
mitosis (e.g., spindle fibers) can give rise to the multiple 
aneuploidies typically seen in tumor cells. Aneuploidy can 
contribute to tumorigenesis by creating extra copies of onco¬ 
genes or by deleting tumor suppressor genes. 

Genomic instability, which can result from 
defects in DNA repair, is often observed in 
tumor cells and is characterized by widespread 
mutations, chromosome breaks, and aneu¬ 
ploidy. These alterations can cause cancer 
when they affect pathways that regulate cel¬ 
lular proliferation. 

Genetic Alterations and Cancer Cell Immortality 

Even after a tumor cell has escaped regulation by tumor sup¬ 
pressors or DNA repair proteins, it must overcome one more 
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hurdle to unlimited proliferation: the intrinsic limitation on 
the number of cell divisions allowed to each cell. Ordinarily, 
a cell is restricted to about 50 to 70 mitotic divisions. After 
reaching this number, the cell typically becomes senescent 
and cannot continue to divide. Research has provided new 
insights on the mechanisms that count the number of cell 
divisions and has illustrated the ways tumor cells can circum¬ 
vent the counting system. 

Each time a cell divides, the telomeres of chromosomes 
shorten slightly because DNA polymerase cannot replicate 
the tips of chromosomes. Once the telomere is reduced to a 
critical length, a signal is transmitted that causes the cell to 
become senescent. This process would place severe limita¬ 
tions on the proliferating cells in a tumor, preventing further 
clonal expansion. Many tumor cells overcome the process by 
activating a gene that encodes telomerase, a reverse tran¬ 
scriptase that replaces the telomeric segments that are nor¬ 
mally lost during cell division. Activation of this enzyme, 
which is rarely present in normal cells but is found in 85% 
to 90% of tumor cells, is part of a process that allows a 
tumor cell to continue to divide without the restraint ordi¬ 
narily imposed by telomere shortening. This uninhibited 
division allows the tumor to become large, and, by allowing 
continued DNA replication, it permits the accumulation 
of additional mutations that can further contribute to the 
aggressiveness of the tumor cell. Recent whole-genome 
sequencing studies have shown that somatic mutations in 
the promoter region of the TERT gene (which encodes a 
component of telomerase) are seen in 70% of melanomas. 
These mutations increase the expression of TERT and are 
likely to contribute to increased telomerase activity in mela¬ 
noma cells. 

Normally, progressive shortening of telo¬ 
meres limits the number of divisions of a cell 
to about 50 to 70. Tumor cells overcome this 
limitation by activating telomerase, which 
replaces the telomere segments that are lost 
during each cell division. This appears to help 
tumor cells escape the restraint of cellular 
senescence. 

IDENTIFICATION OF INHERITED 
CANCER-CAUSING GENES 

Although the methods described in the previous section 
have been successful in identifying many oncogenes, they are 
not well suited to the identification of tumor suppressor 
genes. These methods require dominant expression of the 
mutant phenotype, which is characteristic of oncogenes, 
whereas mutant tumor suppressor alleles have a primarily 
recessive phenotype at the level of the cell. Alternative 
approaches to the identification of these tumor suppressor 
genes were necessary. In families with hereditary cancer, 
linkage analysis (see Chapter 8) using polymorphic markers 
can identify the chromosomal segment bearing a cancer- 
causing mutation. This approach has been used to identify 


mutations that cause inherited forms of breast and colon 
cancer (discussed later). 

A second approach takes advantage of the frequent chro¬ 
mosomal losses associated with tumor suppressor genes. As 
already described, inherited tumor suppressor mutations 
typically result in persons who are heterozygous for the muta¬ 
tion in all of their cells (a first hit). However, the tumor sup¬ 
pressor mutation is a recessive allele at the cellular level: there 
are two hits, resulting in the loss of both normal copies of the 
tumor suppressor gene. Often an inherited mutant allele is 
unmasked in tumor cells by a deletion of part of the other 
copy of the homologous chromosome that carried the normal 
allele. Therefore, the observation that a specific chromosome 
segment is deleted in tumor cells suggests a map location for 
the inherited mutation. 

The deleted chromosome regions in tumors can be pin¬ 
pointed by examining a series of closely linked marker poly¬ 
morphisms, such as STRs, in the region and determining 
which of the markers that are heterozygous in the patient’s 
constitutional DNA have become homozygous in tumor 
DNA (i.e., which markers have lost an allele in the process of 
tumorigenesis). This loss of heterozygosity in tumor DNA 
indicates that the normal tumor suppressor gene, as well as 
polymorphic markers surrounding it, has been lost, leaving 
only the abnormal copy of the tumor suppressor gene (see 
Figs. 11-4 and 11-6). This approach was used, for example, to 
help narrow the locations of the retinoblastoma gene on the 
long arm of chromosome 13 and of a gene for Wilms tumor 
(nephroblastoma) on lip. Deleted regions in tumor DNA 
can also be identified by array CGH (see Chapter 6) or 
whole-genome sequencing (see Chapter 3). After a loss of 
heterozygosity is identified in a chromosome region, the 
DNA sequences within this region can be tested to identify 
the tumor-causing mutation(s). 

A third approach for identifying inherited cancer-causing 
variants is through genome-wide association studies (CWAS). 
As discussed in Chapter 8, a CWAS is carried out by compar¬ 
ing the frequencies of thousands to millions of SNPs, typi¬ 
cally assayed on a microarray, in a large series of cases and 
matched disease-free controls. If a SNP shows a significantly 
different frequency in the two groups, it is likely to be within 
or near a gene that contributes to cancer susceptibility. This 
approach has been especially effective in identifying addi¬ 
tional risk loci for common cancers such as those of the 
breast, colon, and prostate. Typically, these SNPs individually 
raise (or lower) the risk of cancer only modestly, by 5% to 
20%. However, when a cancer risk profile is based on multiple 
cancer-associated SNPs, significant information about an 
individual’s cancer risk can be derived. 

As exome and whole-genome sequencing has become 
more affordable (see Chapter 8), this technology has been 
used increasingly to carry out case-control and family-based 
studies to identify cancer-causing genes. Because sequencing 
identifies all variants in the exome or genome, it is much 
more likely to pinpoint a specific cancer-causing variant than 
is a microarray-based study. Many of these variants are rare 
in populations, so large samples of cases and controls are 
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FIG 11-6 A, A and B represent two microsatellite (STR) polymorphisms that have been assayed 
using DNA from a cancer patient's normal cells (Nj and tumor cells (T). In normal cells, the patient 
is heterozygous for both marker loci. Deletion of the long arm from one of the paired chromo¬ 
somes in tumor cells results In loss of heterozygosity (LOH) for locus B (I.e., the band corre¬ 
sponding to allele 67 is missing, with only a faint signal due to residual traces of normal cells 
in the tumor specimen). LOH is a signpost for a tumor-suppressor gene near the deleted locus. 
B, Loss of heterozygosity in breast tumor cells revealed by a SNP microarray. Each dot represents 
the genotype of one SNP typed in a collection of one person's tumor cells. Each SNP has three 
possible genotypes: AA, AB, or BB. A region of more than 20 Mb on chromosome 2 has no 
heterozygous SNPs (genotype AB), Illustrating a loss of heterozygosity in this region (and thus 
the presence of a likely tumor suppressor gene, with only the mutated copy expressed in these 
cells). (A, Courtesy Dr. Dan Fults, University of Utah Health Sciences Center.) 


needed to provide adequate evidence that the variant is likely 
to cause cancer. Families with multiple affected members can 
prove useful when analyzing rare variants because the prin¬ 
ciples of linkage can be used to show that a cancer-causing 
variant is transmitted along with a cancer phenotype within 
a family. 

In many sequencing studies, the DNA sequence from 
tumor samples is compared with DNA sequences from tissues 
unaffected by the tumor (e.g., white blood cells or buccal 
cells). DNA from the unaffected tissue represents the patient’s 
germline genetic constitution, while the DNA from a tumor 
is likely to have undergone non-inherited somatic mutations 
that contribute to tumorigenesis. Genes that consistently 
show elevated levels of somatic mutations in a specific 
tumor type become candidates as cancer-causing loci. Many 


thousands of tumor DNA samples have now been analyzed 
using this approach, revealing dozens of additional cancer- 
causing genes. 

The techniques outlined here (linkage analysis, loss of 
heterozygosity analysis, GWAS, and DNA sequencing) have 
identified hundreds of genes that are mutated in various 
cancers. Some of these genes contribute directly to a growth 
advantage in tumors and are considered primary causes 
of cancer. Approximately 150 such driver genes have been 
described. A much larger number of genes undergo somatic 
mutations during tumorigenesis, but these genes do not 
directly confer a growth advantage to cells. These are termed 
passenger genes. The RBI gene, which normally acts as 
a “brake” on cell division, is an example of a well-known 
driver gene. 
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Cancer-associated genes can be detected 
through linkage analysis or by showing that 
one homolog of a chromosome (or a part 
of it) is missing in DNA from a tumor. GWAS 
and exome or whole-genome sequencing are 
also used to identify cancer-causing genes. 
These approaches have identified driver genes, 
which give cells a growth advantage leading 
to cancer, as well as passenger genes, which 
are mutated in cancer but do not confer a 
growth advantage. Confirmation of the etio¬ 
logical role of a possible cancer-causing gene 
is obtained by showing the consistent pres¬ 
ence of mutations in the gene in DNA from 
affected patients. 

Neurofibromatosis Type 1 

The initial evidence for mapping the neurofibromatosis type 
1 gene (NFl) to chromosome 17 came from linkage studies 
in families. Subsequently, chromosomal translocations were 
discovered in the karyotypes of two unrelated patients with 
neurofibromatosis, each of whom had a breakpoint on chro¬ 
mosome 17q at a location indistinguishable from the map 
location of the NPl gene. These translocations were assumed 
to have caused neurofibromatosis in these persons by dis¬ 
rupting the NFl gene. Located only 50 kb apart, the break¬ 
points provided the physical clues necessary to define several 
candidate genes that were screened for mutations in patients 
with NFl (Fig. 11-7). 

The nucleotide sequence of the NFl gene provided an 
early clue to function when its predicted amino acid sequence 
was compared with amino acid sequences of known gene 
products found in computerized databases. Extended simi¬ 
larities were observed with the mammalian GTPase-activating 


protein (GAP). This was an important finding, because at 
least one function of GAP is to decrease the amount of active 
GTP-bound RAS. The RAS protein is a key component of the 
signal transduction pathway, transmitting positive growth 
signals in its active form. The NFl gene product, neurofibro- 
min, also plays a role in signal transduction by down¬ 
regulating RAS. 

With the identification of the NFl gene product as a com¬ 
ponent in signal transduction, a picture began to form of how 
the inheritance of a mutation in one NFl allele might con¬ 
tribute to the development of neurofibromas and cafe-au-lait 
spots. Reduced expression of the NFl gene permits increased 
RAS activity and allows the cell to escape from differentiation 
and continue its growth. Loss of the remaining normal allele 
in some cells (e.g., Schwann cells) further encourages 
unchecked growth. Discovery of the NFl gene has led to the 
identification of a key tumor suppressor that helps to regulate 
the fundamental process of signal transduction. 

The gene responsible for NFl was mapped by 
linkage in families and was identified through 
translocations and point mutations in patients. 
DNA sequencing of the gene predicted a 
protein product with a domain related to GAP, 
and a similar role in down-regulating the RAS 
signal transduction protein was confirmed by 
biochemical experiments. 

The rP5J Gene 

Somatic mutations in the TP53 gene are found in more than 
half of all human tumors, making this the most commonly 
altered cancer-causing gene. Somatic TP53 mutations are 
seen, for example, in approximately 70% of colorectal tumors, 
as well as 40% of breast tumors and 60% of lung tumors. 
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FIG 11-7 Localization of the NFl gene on chromosome 17q involved linkage analysis and iden¬ 
tification of two translocation breakpoints that interrupted the disease gene. Candidate genes 
were isolated from this region and tested for mutations in patients with NFl and in normal 
controls. 
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Approximately 80% to 90% of TP53 mutations are concen¬ 
trated in the portion of the gene that encodes a DNA-binding 
domain, typically preventing the p53 protein product from 
binding to DNA of other genes. 

Like RBI and NFl, TPS3 typically functions as a tumor 
suppressor gene (however, because TP53 mutations often 
have a dominant negative effect, the 2-hit model does not 
typically apply to this tumor suppressor gene). Its protein 
product, p53, increases in quantity in response to cell damage 
(e.g., double-stranded DNA breaks caused by ionizing radia¬ 
tion). Acting as a transcription factor, p53 helps to regulate 
dozens of genes that affect cell growth, proliferation, and 
survival. For example, p53 binds to the promoter of CDKNIA, 
whose protein product, p21, blocks the inactivation of pRb 
(see Fig. 11-5). This halts the cell cycle in the G1 phase, before 
DNA replication occurs in S phase. Arrest of the cell cycle 
before S phase provides time for repair of the damaged DNA. 
If the cell’s DNA is severely damaged, p53 may instead induce 
programmed cell death (apoptosis). This response is more 
likely if the pRb pathway for cell cycle arrest is not intact. 
Lacking the possibility of cell cycle arrest to repair damage, 
the p53 protein “chooses” cell death by interacting with genes 
involved in apoptotic pathways (e.g., PTEN, BAX). In this 
way, p53 prevents the proliferation of an abnormal, poten¬ 
tially carcinogenic cell. 

When TP53 is mutated, cells with damaged DNA can 
evade both repair and destruction, and continued replication 
of the damaged DNA can lead to tumor formation. For this 
to happen, other components of cell cycle control must also 
be compromised. For example, several DNA tumor viruses, 
such as the human papilloma virus that is responsible for 
most cases of cervical cancer, inactivate both pRb and p53. 
This produces cells that can neither repair their DNA nor 
undergo apoptosis in response to damage, leading in some 
cases to cancer. 

Carcinogenic substances can induce specific TP53 muta¬ 
tions. Dietary ingestion of aflatoxin Bi, which can produce 
liver cancer, is associated with a mutation that produces an 
arginine-to-serine substitution at position 249 of the p53 
protein. Exposure to benzopyrene, a powerful mutagen and 
carcinogen found in cigarette smoke, leads to alterations of 
specific TP53 base pairs in lung tumors. This demonstrates a 
direct molecular link between cigarette smoking and lung 
cancer. Thus, examination of the type of TP53 mutation seen 
in a tumor can provide clues to the identity of the causative 
carcinogenic agent. 

Although tumor-causing TP53 mutations have been 
observed mostly in somatic cells, germline mutations of TP53 
are responsible for an inherited cancer condition known as 
the Li-Fraumeni syndrome (LFS). This rare condition is 
transmitted in autosomal dominant fashion and involves 
breast and colon carcinomas, soft-tissue sarcomas, osteosar¬ 
comas, brain tumors, leukemia, and adrenocortical carcino¬ 
mas. These tumors usually develop at early ages in LFS family 
members, and multiple primary tumors are commonly seen 
in persons with LFS. The demonstration of consistent TP53 
mutations in the constitutional DNA of patients with LFS 


confirmed the causative role of this gene. The inheritance of 
a mutated TP53 gene greatly increases an individual’s suscep¬ 
tibility to subsequent cell transformation and tumor develop¬ 
ment. Among LFS family members who inherit a mutated 
TP53 gene, approximately 50% develop invasive cancer by 30 
years of age, and more than 90% develop invasive cancer by 
age 70 years. 

TP53 mutations account for only about 75% of LFS cases; 
some of the remaining cases are the result of mutations in 
another tumor suppressor gene, CEIEK2. This gene encodes 
a kinase that normally phosphorylates p53 in response to 
ionizing radiation, resulting in the accumulation and activa¬ 
tion of p53. Loss-of-function mutations in CHEK2 result in 
a lack of p53 activation, causing LFS via the p53 pathway. 

TP53 is medically important in at least two ways. First, the 
presence of TP53 mutations in tumors, particularly those of 
the breast and colon, often signals a more-aggressive cancer 
with relatively poor survival prospects. It is thus a useful 
prognostic indicator. Second, TP53 might ultimately prove 
important in tumor prevention. Laboratory experiments 
show that the insertion of a normal TP53 gene into tumor 
cells can induce tumor regression by inducing abnormal 
cancer cells to undergo apoptosis. This has led to gene therapy 
protocols (see Chapter 13) in which normal TP53 copies are 
inserted into tumors in an effort to eliminate cancerous cells. 

Somatic mutations of the TP53 gene are found 
in the majority of tumors. This gene encodes 
a protein that can induce either cell cycle 
arrest or apoptosis in response to damaged 
DNA. Inherited TP53 mutations can cause Li- 
Fraumeni syndrome. 

The Familial Adenomatous Polyposis Gene, APC 

Colorectal cancer (CRC) affects approximately 1 in 20 
Americans, and, like most common cancers, it is more likely 
to occur in persons with a positive family history. One’s risk 
of developing CRC is elevated two- to threefold if a first- 
degree relative is affected; it increases to three- to sixfold if 
two first-degree relatives are affected. Approximately 2% to 
5% of colon cancer cases are inherited as autosomal domi¬ 
nant syndromes, the two most important of which are dis¬ 
cussed here. 

Familial adenomatous polyposis (FAP) is characterized by 
the appearance of large numbers of colonic adenomas, a type 
of polyp, in the second or third decade of life. Colonic adeno¬ 
mas are now understood to be the immediate precursors 
to colorectal cancer. The multiple adenomas of the patient 
with FAP therefore present a grave risk of early malignancy. 
Because early detection and removal of adenomatous polyps 
can significantly reduce the occurrence of cancer, it is impor¬ 
tant to understand the causative gene and its role in the 
development of polyps (Clinical Commentary 11-1). 

The gene responsible for FAP was localized to the long arm 
of chromosome 5 by linkage analysis in families. Discovery 
of small, overlapping deletions in two unrelated patients pro¬ 
vided the key to isolation of the disease-causing gene, termed 
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FIG 11-8 The pathway to colon cancer. Loss of the APC gene transforms normal epithelial tissue 
lining the gut to hyperproliferating tissue. Hypomethylation of DNA (which can cause genomic 
instability and up-regulate proto-oncogenes), activation of the KRAS proto-oncogene, and loss of 
the SMAD4 gene are involved in the progression to a benign adenoma. Loss of the TP53 gene 
and other alterations are involved in the progression to malignant carcinoma and metastasis. 
Note that these alterations are present in varying frequencies in colon tumor cells. (Modified 
from Vogelstein B. Cancer genome landscapes. Science. 2013:339:1546-1558.) 


APC (adenomatous polyposis coli). Among the genes that lay 
within the 100-kb region that was deleted in both patients, 
one showed apparent mutations in other patients. This muta¬ 
tion was seen in one patient but not in his unaffected parents 
(i.e., a de novo mutation; see Chapter 4), which helped to 
confirm the identification of the APC gene. 

Like RBI and NPl, APC is a tumor suppressor gene, and 
both copies of APC must be inactivated in a cell for tumor 
progression to begin. Persons who inherit an APC mutation 
(the first “hit”) typically experience somatic loss-of-function 
mutations in hundreds of their colonic epithelial cells, giving 
rise to multiple adenomas. In some cases, loss of function of 
APC occurs because of hypermethylation of APC’s promoter 
region, which results in reduced transcription (see Chapter 
5). Hypermethylation, as well as other changes in gene regula¬ 
tion (Box 11-1) has been observed in the inactivation of a 
number of tumor suppressor and DNA repair genes, includ¬ 
ing those associated with retinoblastoma (RBI), breast/ 
ovarian cancer (BRCAl), hereditary nonpolyposis colorectal 
cancer (MLHl), malignant melanoma (CDKN2A), and von 
Hippel-Lindau disease (VHL). 

Identification of the APC gene has been important in diag¬ 
nosing and managing CRC in families with FAP (see Clinical 
Commentary 11-1). In addition, and perhaps even more 
importantly, somatic APC mutations are found in 85% of all 
sporadic, noninherited cases of colon cancer. These somatic 
APC mutations (i.e., those that disable both copies of the gene 
in a colonic cell) are among the earliest alterations that give 


rise to colon cancer. But APC mutations are not sufficient 
by themselves to complete the progression to metastatic 
disease. As shown in Figure 11-8, other genes are also altered. 
For example, gain-of-function mutations are seen in the 
KRAS gene in approximately 50% of colon tumors. This 
gene encodes a signal transduction molecule, and a gain-of- 
function mutation increases signaling and thus cellular pro¬ 
liferation. Loss-of-function mutations in the TP53 gene are 
also seen in more than 50% of colorectal tumors and usually 
occur relatively late in the pathway to cancer. Ordinarily, 
p53 would be activated in response to mutations like those 
of APC and KRAS, leading to DNA repair or apoptosis. Cells 
that lack p53 activity are free to continue along the path to 
malignancy in spite of their damaged DNA. Still another 
tumor suppressor gene, SMAD4, is also frequently mutated 
in the CRC pathway. Thus, multiple mutations are required 
in tumor suppressor genes and oncogenes to produce a 
colorectal tumor. 

Extensive studies have revealed at least three ways in which 
the APC protein acts as a tumor suppressor. Perhaps most 
importantly, it participates in the phosphorylation and deg¬ 
radation of (J-catenin, a key molecule in the Wnt signal trans¬ 
duction pathway. Among other things, this pathway is 
involved in activation of the Myc transcription factor. By 
reducing (J-catenin levels, APC dampens signals that lead to 
cellular proliferation. Examination of colorectal carcinomas 
that do not carry mutations in the APC gene revealed that a 
small proportion of them have gain-of-function mutations in 
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BOX 11-1 Epigenetics and Cancer 


Epigenetics is defined as the study of changes in gene expres¬ 
sion or phenotype caused by mechanisms other than variation 
in DNA sequences. Examples of epigenetic changes include 
methylation, histone modification, and mRNA-binding by 
microRNAs. As discussed in Chapter 2, methylation of a gene's 
promoter region, along with histone hypoacetylation, are asso¬ 
ciated with chromatin condensation, which inhibits the binding 
of transcription factors to a gene's promoter. Consequently, 
expression of the gene (i.e., transcription to mRNA) is reduced. 
Methylation and histone modification are involved in the pro¬ 
cesses of genomic imprinting (see Chapter 5) and X inactivation 
(see Chapter 6). Epigenetic alterations can cause persons with 
the same DNA sequences (i.e., identical twins) to have quite 
different disease profiles. 

Epigenetic alterations are important in cancer because they 
can alter the expression of many cancer-associated genes. 
Tumor cells typically exhibit widespread hypomethylation 
(decreased methylation), which can increase the activity of 
oncogenes. Hypomethylation increases as tumors progress 


from benign neoplasms to malignancy. In addition, the pro¬ 
moter regions of tumor-suppressor genes (e.g., RBI and 
BRCA1) are often hypermethylated, which decreases their rate 
of transcription and their ability to inhibit tumor formation. 
Hypermethylation also is seen in specific subgroups of 
microRNA genes. When these microRNA genes are methyl¬ 
ated, their mRNA targets are over-expressed, and this over¬ 
expression has been associated with tumorigenesis. 

Except through gene therapy (see Chapter 13), mutations in 
DNA sequences cannot be altered. Epigenetic modifications, 
however, can be reversed through the administration of thera¬ 
peutic drugs. For example, 5-azacytidine, a demethylating 
agent, has been used to treat leukemia and myelodysplastic 
syndrome. Another class of drugs, histone deacetylase (HDAC) 
inhibitors, counteracts the histone hypoacetylation that can 
silence the activity of tumor-suppressor genes. HDAC inhibitors 
have been used in the treatment of T-cell lymphomas. A chal¬ 
lenge in developing drugs that modulate epigenetic alterations 
is to target only the genes responsible for a specific cancer. 
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Global epigenomic alterations and cancer. Oncogenesis involves accumulated genetic alterations 
combined with the epigenetic changes: DNA methylation, histone modifications, and miRNAs. 
In cancer cells, tumor-suppressor genes become hypermethylated and with histone modifica¬ 
tions cause abnormal gene silencing. The gene silencing of tumor-suppressor genes results in 
tumor progression. Global hypomethylation leads to chromosomal instability and fragility. Addi¬ 
tionally, these modifications create abnormal mRNA and miRNA expression, which leads to 
activation of oncogenes and silencing of tumor-suppressor genes. (From McCance KL, Huether 
SE, Brashers VL, Rote NS. Pathophysiology: The Biologic Basis for Disease in Adults and Chil¬ 
dren. 7th ed. St Louis: Elsevier; 2014. Adapted from Sandoval J, Esteller M. Cancer epigenomics: 
beyond genomics. Curr Opin Genet Dev 2012:22:50-55.) 
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CLINICAL COMMENTARY 11-1 
The APC Gene and Colorectal Cancer 

Colorectal cancer (CRC) will be diagnosed in about 1 in 20 
Americans. Currently, the mortality rate for this cancer is 
approximately one third. Genetic and environmental factors, 
such as dietary fat and fiber, are known to influence the prob¬ 
ability of occurrence of colorectal cancer. 

As indicated in the text, familial adenomatous polyposis (FAP) 
is an autosomal dominant subtype of colon cancer that is char¬ 
acterized by a large number of adenomatous polyps (Fig. 11-9). 
These polyps typically develop during the second decade of life 
and number in the hundreds or more (polyposis is defined as 
the presence of >100 polyps). The average age of CRC diag¬ 
nosis in untreated FAP patients is 39 years; 95% of untreated 
FAP patients will develop CRC by age 50. Germline mutations 
in the APC gene are consistently identified in family members 
affected with FAP, and approximately 25% of cases are the 
result of new mutations in APC. More than 2000 different 
mutations of the APC gene have been reported, most of which 
are nonsense or frameshift mutations. Because these muta¬ 
tions produce a truncated protein product, a protein trunca¬ 
tion test can be used to help determine whether the at-risk 
person has inherited an APC mutation. (As discussed in Chapter 
3, this test involves the in vitro manufacture and analysis of a 
protein product from the gene of interest.) It is now more 
common to rely on DNA-based tests, including direct sequenc¬ 
ing, to identify APC mutations. These diagnostic tests, which 
identify mutations in 80% to 90% of classic FAP cases, are 
important for family members because a positive result alerts 



FIG 11-9 A portion of a colon removed from a patient with 
familial adenomatous polyposis, illustrating a large number 
of adenomatous polyps covering the colon. Each of these 
benign neoplasms has the potential to become a malignant 
tumor. 


them to the need for frequent surveillance and possible 
colectomy. 

FAP is relatively rare, affecting only about 1 in 10,000 persons 
and accounting for less than 1 % of all cases of colon cancer. 
The broader significance of the APC gene derives from the fact 
that somatic mutations in this gene are seen in approximately 
85% of all colon cancers. Furthermore, APC mutations typically 
occur early in the development of colorectal malignancies. A 
better understanding of the APC gene product, how it interacts 
with other proteins, and how it interacts with environmental 
factors such as diet, has provided important clues for the pre¬ 
vention and treatment of common CRC. In this way, the iden¬ 
tification of a gene responsible for a relatively rare cancer 
syndrome can have widespread clinical implications. 

Treatment for FAP usually involves surgical resection of the 
colon and chemotherapy. However, because CRC is usually 
preceded by the appearance of benign polyps, it is one of the 
most preventable of cancers. The National Polyp Study Work¬ 
group estimates that colonoscopic removal of polyps could 
reduce the nationwide incidence of CRC by as much as 90%. 
The importance of early intervention and treatment further 
underscores the need to understand early events in colorectal 
cancer, such as somatic mutations in the APC gene. 

Because polyps usually begin to appear in the second decade 
of life in those who inherit an APC mutation, an annual colo¬ 
noscopy is recommended in these persons starting at 10-12 
years of age. Upper gastrointestinal polyps are seen in 90% of 
FAP patients by age 70. Thus, endoscopy of the upper gastro¬ 
intestinal tract is recommended every 1 to 2 years, starting at 
ages 20 to 25 years. Because classic FAP results in hundreds 
to thousands of polyps, colectomy is often necessary by age 
20 years. There is evidence that the use of nonsteroidal anti¬ 
inflammatory drugs causes some degree of polyp regression. 
Persons with FAP have increased risks of other cancers, includ¬ 
ing gastric cancer (<1 % lifetime risk), duodenal adenocarci¬ 
noma (5-10% lifetime risk), hepatoblastoma (1% risk), and 
thyroid cancer. 

Mutations in the APC gene can also produce a related syn¬ 
drome, termed attenuated familial adenomatous polyposis. 
This syndrome differs from FAP in that patients have fewer 
than 100 polyps (typically 10-20). Most of the mutations that 
produce this form of FAP are located in the 5' or 3' regions of 
APC. 

FAP can also result from recessive mutations in MUTYH, a 
gene that encodes a DNA repair protein. These mutations are 
estimated to account for about 30% of cases of attenuated 
FAP and approximately 10% to 20% of classic FAP cases in 
which an APC mutation is not detected. 


the (3-catenin gene, thus confirming the potential etiological 
role of this gene in CRC. APC mutations are also thought to 
affect cell-to-cell and cell-to-matrix adhesion properties (this 
is important because alteration of cell adhesion control 
permits cells to invade other tissues and to metastasize to 
other sites). Again, this activity is mediated through (J-catenin, 
which interacts with a cell-surface molecule (E-cadherin) 


whose loss of function leads to abnormal cell adhesion prop¬ 
erties. Finally, APC is expressed in the microtubules that pull 
chromosomes apart during meiosis (see Chapter 2). Altera¬ 
tions in APC result in altered microtubule activity, such that 
aneuploidies and chromosome breaks arise during mitosis. 
Thus, APC mutations also promote cancer by increasing 
genomic instability. 
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The adenomatous polyposis coli gene (APC), 
which strikingly predisposes to colon cancer, 
was ultimately identified from mutations in 
patients. APC is also involved in the great 
majority of sporadic cases of colon cancer and 
is in fact one of the earliest alterations leading 
to colon tumorigenesis. This tumor suppressor 
gene has been shown to function as a major 
regulator of the Wnt signal transduction 
pathway via its interaction with p-catenin. It is 
also involved in cell adhesion control and in 
maintaining chromosome stability during 
mitosis. 

The Hereditary Nonpolyposis Colon 
Cancer Genes 

Hereditary nonpolyposis colon cancer (HNPCC, or Lynch 
syndrome), a second form of inherited colon cancer, accounts 
for up to 5% of all colorectal cancer cases. Like LAP, HNPCC 
is an autosomal dominant, high-penetrance cancer syn¬ 
drome, with a lifetime colorectal cancer risk of 50% to 80% 
in heterozygotes. In addition, the risk of endometrial cancer 
among females with HNPCC is approximately 40% to 60%, 
and the risk of ovarian cancer is 5% to 10%. Cancers of the 
small bowel, stomach, brain, pancreas, renal pelvis, and ureter 
are seen in a smaller percentage of mutation carriers. In con¬ 
trast to FAP, HNPCC patients do not have polyposis; they 
typically have a relatively small number of polyps. Also, 
polyps in HNPCC patients are more likely to occur in the 
proximal colon, whereas those of FAP patients are more likely 
to be concentrated in the distal colon. 

Approximately 40% to 60% of HNPCC cases are caused 
by mutations in a gene called MSH2, and another 25% to 
30% of cases are caused by mutations in the MLHl gene. 
Mutations in three other genes, PMS2, MSH6, and EPCAM, 
help to account for a small percentage of additional cases. 
Each of these genes is known to play an important role in 
DNA mismatch repair in many different organisms (in fact, 
a critical clue in their identification was the existence of 
highly similar DNA repair genes in yeast and bacteria). Inac¬ 
tivation of both alleles of any one of these genes increases the 
genome-wide mutation rate in an affected cell by as much as 
1000-fold. This increased rate of mutation results in the alter¬ 
ation of a number of cellular regulatory genes, thus leading 
to an increased incidence of cancer. A characteristic feature 
of tumors from HNPCC patients is a high degree of instabil¬ 
ity of microsatellite loci (see Chapter 3), which generates 
many new microsatellite alleles. Such microsatellite instability 
is also seen in about 15% of sporadic colorectal carcinomas, 
but somatic loss-of-function mutations in the HNPCC genes 
seem to occur only infrequently in these tumors. Instead, 
the most common alteration seen in these sporadic tumors 
is hypermethylation of the MLHl gene, resulting in its 
inactivation. 

A comparison of FAP and HNPCC reveals interesting 
differences in the way each syndrome leads to colon cancer. 
In FAP, an inherited APC mutation results in hundreds of 


polyps, each of which has a relatively low probability of 
incurring all of the other genetic alterations required for 
progression to metastatic cancer. But, because the number 
of polyps is large, there is a high probability (almost 100%) 
that at least one of them will produce a cancerous tumor. 
In HNPCC, the number of polyps is much smaller (hence 
the term nonpolyposis), but, because of a relative lack of 
DNA repair, each polyp has a high probability of experienc¬ 
ing the multiple alterations necessary for tumor develop¬ 
ment. Consequently, the average age of onset of colorectal 
cancer in HNPCC is only about 10 years later than that 
of FAP. 

HNPCC (Lynch syndrome) is an inherited form 
of colorectal cancer that is caused by muta¬ 
tions in any of five genes involved in DNA 
mismatch repair. It represents an example of 
a cancer syndrome associated with microsatel¬ 
lite instability. 

Inherited Breast Cancer 

The lifetime prevalence of breast cancer in women is 1 in 8, 
and a woman s risk of developing breast cancer doubles if a 
first-degree relative is affected. Two genes, BRCAl and BRCA2, 
have been identified as major contributors to inherited breast 
cancer. This section addresses three critical questions regard¬ 
ing these genes: What percentage of breast cancer cases are 
the result of BRCAl and BRCA2 mutations? Among those 
who inherit a mutation, what is the risk of developing cancer? 
How do mutations in these genes contribute to cancer 
susceptibility? 

Population-based studies show that only a small percent¬ 
age of all breast cancers—approximately 1% to 3%—can be 
attributed to inherited mutations in BRCAl or BRCA2. 
Among women with breast cancer who also have a positive 
family history of the disease, the percentage with inherited 
mutations in either of these genes increases to approximately 
20% to 30%. Among affected women who have a positive 
family history of both breast and ovarian cancer, 60% to 80% 
have inherited a BRCAl or BRCA2 mutation. Inherited muta¬ 
tions in these genes are also more common among women 
with early-onset breast cancer and among those with bilateral 
breast cancer. 

Women who inherit a mutation in BRCAl experience a 
50% to 80% lifetime risk of developing breast cancer; the 
lifetime risk for those who inherit a BRCA2 mutation is 
slightly lower, averaging approximately 50%. BRCAl muta¬ 
tions also increase the risk of ovarian cancer among women 
(40-50% lifetime risk, which is substantially higher than the 
lifetime risk of 1/70 in the general female population). These 
mutations also confer a modestly increased risk of prostate 
and colon cancers. BRCA2 mutations confer an increased risk 
of ovarian cancer (20% lifetime risk). Approximately 6% of 
males who inherit a BRCA2 mutation develop breast cancer, 
which represents a 70-fold increase over the risk in the general 
male population (see Chapter 12 for further discussion of 
breast cancer risk factors). 
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BRCAl and BRCA2 were both initially identified by linkage 
analysis in families, followed by positional cloning. Most 
BRCAl and BRCA2 mutations result in truncated mRNA or 
protein products and a consequent loss of function. As is the 
case for the RBI and ARC genes, affected persons inherit one 
copy of a BRCAl or BRCA2 mutation and then experience a 
somatic loss of the remaining normal allele in one or more 
cells (following the two-hit model for tumor suppressor 
genes). In contrast to RBI and ARC, somatic mutations 
affecting these genes are seldom seen in sporadic (noninher- 
ited) breast tumors. BRCAl and BRCA2 are both large genes, 
and they exhibit extensive allelic heterogeneity (approxi¬ 
mately 2000 distinct mutations, most of which are deletions 
or nonsense or frameshift mutations, have been reported 
for each gene). This poses challenges for genetic diagnosis 
(see Chapter 13), which is done primarily by direct DNA 
sequencing of the coding and regulatory regions of both 
genes. Typically, a panel of approximately one dozen breast 
cancer-associated risk genes are now tested for disease- 
causing mutations (see Chapter 13). 

Although BRCAl and BRCA2 share no significant DNA 
sequence similarity, they both participate in the DNA repair 
process. The protein product of BRCAl is phosphorylated 
(and thus activated) by the ATM and CHEK2 kinases in 
response to DNA damage (Fig. 11-10). The BRCAl protein 
product binds to the BRCA2 product, which in turn binds to 
RAD51, a protein involved in the repair of double-stranded 
DNA breaks (as with the HNRCC genes, yeast and bacteria 
both have DNA-repair genes similar to RAD51). BRCAl and 
BRCA2 thus participate in an important DNA repair pathway, 
and their inactivation results in incorrect DNA repair and 
genomic instability. In addition to their roles in the RAD51 
pathway, BRCAl and BRCA2 help to suppress tumor forma¬ 
tion through their interactions with previously discussed 
proteins, such as p53, pRb, and Myc, and by helping to main¬ 
tain genomic stability. 

Because all of the genes illustrated in Figure 11-10 are 
involved in a DNA repair pathway, it might be anticipated 
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FIG 11-10 The roles of BRCAl and BRCA2 in DNA repair. 
BRCAl is phosphorylated by ATM and CHEK2 in response to 
double-stranded DNA breaks (produced, for example, by ion¬ 
izing radiation). BRCAl binds to BRCA2, which interacts with 
RAD51 to form a complex involved in DNA repair. 


that mutations in genes other than BRCAl or BRCA2 could 
lead to DNA repair defects and possibly cancer. This is indeed 
the case. As already discussed, mutations in CHEK2 can 
cause FFS. Mutations in the ATM gene can cause ataxia tel¬ 
angiectasia (see Chapter 3), an autosomal recessive disease 
that involves extensive genomic instability, cerebellar ataxia, 
dilated vessels in the eyes and skin (telangiectasia), and 
cancers of mainly lymphoid origin. Another autosomal reces¬ 
sive chromosome instability syndrome, Fanconi anemia, can 
be caused by the inheritance of two copies of a BRCA2 
mutation. 

Although BRCAl and BRCA2 mutations are the most 
common known causes of familial breast cancer, this disease 
can also be caused by inherited mutations in several other 
tumor suppressor genes (e.g., the previously discussed CHEK2 
and TR53 genes). Germline mutations in a tumor suppressor 
gene called RTEN are responsible for Cowden disease, which 
is characterized by multiple benign tumors and an increased 
susceptibility to breast cancer. The risk of breast cancer 
among heterozygous carriers of mutations in the ATM gene 
is approximately double that of the general population. 
Mutations in the RALB2 gene, which forms a complex with 
BRCAl and BRCA2 in repairing double-stranded DNA 
breaks, also double the risk of breast cancer. 

It is estimated that the major breast cancer genes, such as 
BRCAl, BRCA2, RTEN, RALB2, and CHEK2, account for less 
than 25% of the overall inherited predisposition to breast 
cancer. Other breast cancer-causing genes are likely to exist, 
but their individual effects on cancer risk are thought to be 
relatively small. Farge-scale genome-wide association and 
DNA sequencing studies (see Chapter 8) have now identified 
multiple additional inherited genetic variants that contribute 
small increases in breast cancer risk (Fig. 11-11). For example, 
a variant in the gene that encodes fibroblast growth factor 
receptor 2 (FGFR2) increases breast cancer risk by about 25%. 

Most studies indicate that the clinical course of breast 
cancer among patients with BRCAl or BRCA2 mutations is 
not substantially different from that of other breast cancer 
patients. Flowever, the substantial increase in risk of ovarian 
cancer (which has a high mortality rate and is difficult to 
detect early), has led to the recommendation that women 
who have a causal BRCAl or BRCA2 mutation should 
undergo a prophylactic oophorectomy after they have com¬ 
pleted their child-bearing years. This reduces ovarian cancer 
risk by about 90%, and because it reduces estrogen levels, it 
reduces breast cancer risk by about 50%. Bilateral prophylac¬ 
tic mastectomy, an option chosen by some BRCAl and 
BRCA2 mutation carriers, reduces the risk of breast cancer by 
approximately 90%. 

Mutations in BRCAl and BRCA2 are responsi¬ 
ble for a significant fraction of inherited breast 
cancer cases, especially those of early onset. 
These mutations usually result in a truncated 
protein product and loss of function. The 
protein products of these genes play important 
roles in DNA repair. 
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FIG 11-11 The relationship between the relative risk of breast cancer due to a cancer-associated 
variant (y-axis) and the frequency of the variant in the population (x-axis). Inherited disease- 
causing variants in BRCAl and BRCA2 are collectively rare in the population, but they confer a 
large increase in risk among those who have the variants. Risk alleles in ATM and CHEK2 have 
higher frequencies, but they confer only a two- to threefold increase in risk. Dozens of common 
SNP alleles are associated with an increase in breast cancer risk, but they each confer an 
additional risk of only 5% to 20%. (Courtesy Sean Tavtigian, PhD, Huntsman Cancer Institute, 
University of Utah Health Sciences Center.) 


Familial Melanoma 

Largely as a result of increased exposure to ultraviolet radia¬ 
tion, the incidence of melanoma has increased approximately 
20-fold in the United States during the past 70 years. It is now 
one of the most common cancers, with 76,000 new cases per 
year. The risk of developing melanoma increases by a factor 
of 2 when a first-degree relative is affected. The risk increases 
further, to approximately sixfold, when the first-degree rela¬ 
tive is affected before 50 years of age. It is estimated that 
approximately 10% of melanoma cases occur in inherited, 
familial forms. 

Germline mutations in the CDKN2A gene are estimated 
to be involved in 20% to 40% of familial melanoma cases. 
CDKN2A encodes two different proteins, both of which are 
important components of the cell cycle. The first protein, 
pi6, is a cyclin-dependent kinase inhibitor which interacts 
negatively with a cyclin-dependent kinase (CDK4) that 
phosphorylates and down-regulates the pRb protein (see Fig. 
11-5). Because pRb acts as a brake on the cell cycle, its down- 
regulation by CDK4 promotes progression through the cell 
cycle and can lead to cell proliferation. By negatively regulat¬ 
ing CDK4, pl6 acts as an upstream brake on the cell cycle. 
When pl6 activity is lost through loss-of-function muta¬ 


tions in CDKN2A, CDK4 activity is increased and cell prolif¬ 
eration can ensue. This can lead to melanomas. The second 
protein product encoded by CDKN2A is produced by alter¬ 
native splicing of the gene’s transcript. This protein, termed 
pl4, binds to and inhibits MDM2. The MDM2 protein, as 
shown in Figure 11-5, binds to and degrades p53. Thus, by 
inhibiting MDM2, pl4 increases p53 expression. As dis¬ 
cussed earlier, p53 expression is needed to arrest the cell 
cycle for DNA repair in response to cell damage, and it can 
cause damaged cells to undergo apoptosis. A loss of pl4 
activity leads to a loss of p53 activity, and this can also 
produce melanomas. 

Inherited mutations in the gene that encodes CDK4 can 
also result in familial melanoma. These rare gain-of-function 
mutations convert the cyclin-dependent kinase from a proto¬ 
oncogene to an activated oncogene. The activated CDK4 
down-regulates pRb, resulting again in a lack of cell cycle 
control and tumor formation. Melanoma provides an example 
in which the same tumor type can result from either the 
activation of a proto-oncogene (CDK4) or the loss of a tumor 
suppressor gene (CDKN2A). 

CDKN2A plays a role not only in familial melanoma but 
also in most sporadic melanomas, in which somatic loss-of- 
function mutations of this gene lead to inactivation of the 
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pl6 tumor suppressor protein. About 50% of sporadic mela¬ 
nomas contain somatic deletions of CDKN2A, and loss-of- 
function point mutations are seen in another 5% to 10% of 
melanomas. Hypermethylation of the promoter region of 
CDKN2A, which down-regulates the gene, is seen in % to % 
of all melanomas. As might be expected, somatic mutations 
in other genes are also seen in sporadic melanomas. Approxi¬ 
mately half of these tumors contain somatic gain-of-function 
mutations in BRAF, a gene that encodes a kinase involved in 
the RAS signal-transduction pathway (drugs that inhibit 
BRAF are now being used to treat melanoma). In addition, 
one of the RAS genes, NRAS, is mutated in 15% to 30% of 
sporadic melanomas. Approximately 10% of melanomas 
have somatic TP53 mutations, and about 6% have somatic 
RBI mutations (persons who inherit an RBI mutation also 
have an increased risk of melanoma). 

Genome-wide association studies using microarrays, as 
well as exome and whole-genome sequencing, have identi¬ 
fied a number of additional genes that influence the risk of 
melanoma. Many of these genes are involved in producing 
eumelanin, a skin pigment that helps to protect skin cells 
from harmful ultraviolet radiation. An example is the MCIR 
gene, in which inherited loss-of-function variants are associ¬ 
ated with low levels of eumelanin and high risk of sunburn 
(red hair and freckling are also associated with these vari¬ 
ants). Inheritance of one of these variants increases the risk 
of melanoma by approximately 50%, and inheritance of 
multiple MCIR variants can more than double the risk of 
melanoma. 

Familial melanoma can be caused by loss-of- 
function mutations in the CDKN2A tumor sup¬ 
pressor gene or by gain-of-function mutations 
in the CDK4 proto-oncogene. Both mutations 
result in a loss of cell cycle control via the 
pRb and p53 pathways. Somatic mutations of 
CDKN2A, BRAF, or NRAS are seen in most 
sporadic melanomas. 

The /7fr Proto-Oncogene and Multiple 
Endocrine Neoplasia 

The RET proto-oncogene, initially identified by a transfection 
assay (rearranged during transfection; see previous discus¬ 
sion), encodes a receptor tyrosine kinase that includes an 
extracellular receptor domain, a transmembrane domain, 
and an intracellular tyrosine kinase domain. RET is involved 
in embryonic neural crest cell migration (see Chapter 10), 
and it is normally activated by a complex consisting of glial- 
derived neurotrophic factor (GDNF) and a coreceptor termed 
GFRa. The RET protein interacts with several signal trans¬ 
duction pathways, including the well-known RAS pathway. 

Inherited loss-of-function mutations in RET can produce 
Hirschsprung disease (a lack of enteric nerve cells, resulting 
in severe, chronic constipation and bowel distention). Gain- 
of-function mutations in the same gene result in excess tyro¬ 
sine kinase activity and increased signal transduction, leading 
ultimately to cellular proliferation and, depending on the 
type and location of the mutation, any of three forms of 


autosomal dominant multiple endocrine neoplasia type 2 
(MEN2): (1) MEN2A, which accounts for 80% of MEN2 
cases and is characterized by medullary thyroid carcinomas 
(MTG) in nearly 100% of patients, parathyroid hyperplasia 
in 30% of patients, and pheochromocytoma (an adrenal 
tumor) in 50% of patients. More than 98% of MEN2A cases 
are caused by missense mutations that affect cysteine residues 
in RET’s extracellular domain. (2) MEN2B is similar to 
MEN2A but lacks parathyroid hyperplasia and includes mul¬ 
tiple mucosal neuromas and a marfanoid appearance. Nearly 
all MEN2B alterations are missense mutations that affect 
RET’s tyrosine kinase domain. MEN2B accounts for about 
5% of MEN2 cases and is the most aggressive form of MEN2. 
(3) A syndrome consisting only of familial MTG can be 
caused by mutations in both the extracellular and tyrosine 
kinase domains of RET. 

RET is one of only a few proto-oncogenes in which muta¬ 
tions can cause inherited cancer syndromes (see Table 11-3 
for other examples)."^ Identification of the mutations respon¬ 
sible for each of these inherited cancer syndromes has pro¬ 
vided an accurate means of early diagnosis. Patients with 
apparently sporadic MTG are advised to undergo genetic 
testing for RET mutations because 1% to 7% of these patients 
have germline RET mutations and thus have familial, rather 
than sporadic, MTG. Prophylactic thyroidectomy before 6 
years of age is recommended for children who inherit a 
disease-causing mutation (thyroidectomy before age 3 years 
may be indicated for the more aggressive MEN2B tumors). 

Somatic alterations of RET can produce papillary thyroid 
carcinomas, the most common type of thyroid tumor. The 
prevalence of this tumor increased substantially among 
persons who were exposed to radioactive fallout from the 
Chernobyl nuclear reactor accident. Sixty percent of the pap¬ 
illary thyroid carcinomas in these persons have contained 
somatic RET alterations. 

The RET gene provides an example of extraordinary allelic 
heterogeneity. Loss-of-function mutations in this gene 
produce defects in the embryonic development of the bowel, 
and gain-of-function mutations result in increased signal 
transduction and various forms of endocrine neoplasia. This 
example illustrates the critical connection between normal 
development and cancer: both involve a finely tuned genetic 
regulation of cell growth and differentiation. 

Loss-of-function mutations in the RET proto¬ 
oncogene can produce Hirschsprung disease, 
a disorder of embryonic development. Germ¬ 
line gain-of-function mutations in the same 
gene can lead to any of three different types 
of inherited multiple endocrine neoplasia. 
Somatic alterations in RET can produce nonin- 
herited papillary thyroid carcinoma. 


^Another inherited form of multiple endocrine neoplasia, MENl, is 
characterized by tumors of the parathyroids, anterior pituitary, and 
pancreas. MENl is caused by germline mutations in a gene that 
encodes the menin tumor suppressor. 
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Many additional genes responsible for various inherited 
cancer syndromes have been identified. These include, for 
example, the genes that cause neurofibromatosis type 2, von 
Hippel-Lindau syndrome, and Beckwith-Wiedemann syn¬ 
drome (see Table 11-1). With the current generation of 
genomic resources, including high-throughput DNA sequenc¬ 
ing, it is reasonable to expect the identification of more such 
genes. 

IS GENETIC INHERITANCE IMPORTANT IN 
COMMON CANCERS? 

The term “common cancers” is often used to designate those 
cancers, such as carcinomas of the breast, colon, or prostate, 
that are not usually part of an inherited cancer syndrome 
(e.g., Li-Fraumeni syndrome or familial adenomatous pol¬ 
yposis). Recall that germline mutations in genes such as 
BRCAl or APC, although very important in aiding our 
understanding of the basis of carcinogenesis, are responsible 


for only a small proportion of breast or colon cancer cases. 
Yet these cancers do cluster in families. Typically, the presence 
of one affected first-degree relative increases one’s risk of 
developing a common cancer by a factor of two or more. It 
is likely that additional genes, as well as nongenetic factors 
that are shared in families, contribute to this increased famil¬ 
ial risk (see Chapter 12 for further discussion of these genetic 
and nongenetic factors in common cancers). 

Genome-wide association studies, as well as exome and 
whole-genome sequencing studies, have revealed many 
common cancer-predisposing alleles, each with a relatively 
small effect on overall cancer risk (an example was given by 
an allele of the FGFR2 gene in breast cancer). It is critical to 
determine how these alleles contribute functionally to carci¬ 
nogenesis. The continuing identification of these alleles, and 
characterization of the genes in which they occur, can be 
expected to create new diagnostic and therapeutic tools that 
ultimately will reduce the cancer burden. 


I STUDY QUESTIONS 


1. The G6PD locus is located on the X chromosome. Studies 
of G6PD alleles in tumor cells from women show that all 
tumor cells usually express the same single G6PD allele, 
even though the women are heterozygous at the G6PD 
locus. What does this finding imply about the origin of the 
tumor cells? 

2. If we assume that the somatic mutation rate at the RBI 
locus is three mutations per million cells and that there is 
a population of 2 mhlion retinoblasts per individual, what 
is the expected frequency of sporadic retinoblastoma in the 
population? What is the expected number of tumors per 


individual among those who inherit a mutated copy of the 
RBI gene? 

3. Compare and contrast oncogenes and tumor suppressor 
genes. How have the characteristics of these classes of 
cancer-causing genes affected our ability to detect them? 

4. Members of Li-Fraumeni syndrome families nearly always 
develop tumors by 65 to 70 years of age, but mutation 
carriers do not necessarily develop the same type of tumor 
(e.g., one has breast cancer, and another has colon cancer). 
Explain this. 
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Multifactorial Inheritance and 
Common Diseases 


The focus of previous chapters has been on diseases that are 
caused primarily by mutations in single genes or by abnor¬ 
malities of single chromosomes. Much progress has been 
made in identifying specific mutations that cause these dis¬ 
eases, leading to better risk estimates and in some cases more 
effective treatment. However, these conditions form only a 
small portion of the total burden of human genetic disease. 
A much larger component of our disease burden is composed 
of congenital malformations and common adult diseases, 
such as cancer, heart disease, and diabetes. Although they are 
usually not the result of single-gene mutations or chromo¬ 
some abnormalities, these diseases have significant genetic 
components. They are the result of a complex interplay of 
multiple genetic and environmental factors. Because of the 
significance of common diseases in health care, it is vital to 
understand the ways in which genes contribute to causing 
them. 

PRINCIPLES OF MULTIFACTORIAL 
INHERITANCE _ 

The Multifactorial Model 

Traits in which variation is thought to be caused by the com¬ 
bined effects of multiple genes are called polygenic (“many 
genes”). When environmental factors are also believed to 
cause variation in the trait, which is usually the case, the term 
multifactorial is used. Many quantitative traits (those, such 
as blood pressure, that are measured on a continuous numer¬ 
ical scale) are multifactorial. Because they are caused by the 
additive effects of many genetic and environmental factors, 
these traits tend to follow a normal, or bell-shaped, distribu¬ 
tion in populations. 

Let us use an example to illustrate this concept. To begin 
with the simplest case, suppose (unrealistically) that height is 
determined by a single gene with two alleles, A and a. Allele 
A tends to make people tall, and allele a tends to make them 
short. If there is no dominance at this locus, then the three 
possible genotypes, AA, Aa, and aa, will produce three phe¬ 
notypes: tall, intermediate, and short. Assume that the allele 
frequencies of A and a are each 0.50. If we assemble a popula¬ 
tion of individuals, we will observe the height distribution 
depicted in Figure 12-lA. 

Now suppose, a bit more realistically, that height is deter¬ 
mined by two loci instead of one. The second locus also has 
two alleles, B (tall) and b (short), and they affect height in 
exactly the same way as alleles A and a do. There are now nine 


possible genotypes in our population: aabb, aaBb, aaBB, 
Aabb, AaBb, AaBB, AAbb, AABb, and AABB. Because an indi¬ 
vidual might have zero, one, two, three, or four “tall” alleles, 
there are now five distinct phenotypes (see Fig. 12-lB). 
Although the height distribution in our population is not yet 
normal, it approaches a normal distribution more closely 
than in the single-gene case. 

We now extend our example so that many genes and envi¬ 
ronmental factors influence height, each having a small effect. 
Then there are many possible phenotypes, each differing 
slightly, and the height distribution approaches the bell¬ 
shaped curve shown in Figure 12-lC. Genome-wide associa¬ 
tion studies (GWAS, see Chapter 8) have identified more than 
200 loci associated with human height, affirming that this is 
indeed a polygenic, multifactorial trait. The loci underlying 
variation in a quantitative trait such as height are termed 
quantitative trait loci. 

It should be emphasized that the individual genes underly¬ 
ing a multifactorial trait such as height follow the mendelian 
principles of segregation and independent assortment, just 
like any other genes. The only difference is that many of them 
act together to influence the trait. 

Blood pressure is another example of a multifactorial trait. 
There is a correlation between parents’ blood pressures (sys¬ 
tolic and diastolic) and those of their children, and there is 
good evidence that this correlation is due in part to genes. 
But blood pressure is also influenced by environmental 
factors, such as diet and stress. One of the goals of genetic 
research is identification of the genes responsible for multi¬ 
factorial traits such as blood pressure and of the interactions 
of those genes with environmental factors. 

Many traits are thought to be influenced by 
multiple genes as well as by environmental 
factors. These traits are said to be multifacto¬ 
rial. When they can be measured on a con¬ 
tinuous scale, they often follow a normal 
distribution. 

The Threshold Model 

A number of diseases do not follow the bell-shaped distribu¬ 
tion. Instead, they appear to be either present or absent in 
individuals. Yet they do not follow the patterns expected of 
single-gene diseases. A commonly used explanation is that 
there is an underlying liability distribution for these diseases 
in a population (Fig. 12-2). Persons who are on the low end 
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Genotypes 


A Short Tall 



Aabb AaBb AABb 
AAbb 

Genotypes 


B Short Tall 



FIG 12-1 A, The distribution of height in a population, assum¬ 
ing that height is controlled by a single locus with genotypes 
AA, Aa, and aa. B, The distribution of height, assuming that 
height is controlled by two loci. There are now five distinct 
phenotypes instead of three, and the distribution begins to 
look more like the normal distribution. C, Distribution of 
height, assuming that multiple factors, each with a small 
effect, contribute to the trait (the multifactorial model). 


of the distribution have little chance of developing the disease 
in question (i.e., they have few of the alleles or environmental 
factors that would cause the disease). Those who are closer to 
the high end of the distribution have more of the disease- 
causing alleles and environmental factors and are more likely 
to develop the disease. For multifactorial diseases that are 
either present or absent, a threshold of liability must be 
crossed before the disease is expressed. Below the threshold, 



Liability 



FIG 12-2 A liability distribution for a multifactorial disease in 
a population. To be affected with the disease, a person must 
exceed the threshold on the liability distribution. This figure 
shows two thresholds, a lower one for males and a higher 
one for females (as in pyloric stenosis). 


the person appears unaffected; above it, he or she is affected 
by the disease. 

A disease that is thought to correspond to this threshold 
model is pyloric stenosis, a disorder that manifests shortly 
after birth and is caused by a narrowing or obstruction of the 
pylorus, the area between the stomach and intestine. Chronic 
vomiting, constipation, weight loss, and electrolyte imbalance 
result from the condition, which can be corrected by surgery 
or sometimes resolves spontaneously. The prevalence of 
pyloric stenosis among whites is about 3/1000 live births. It 
is much more common in males than in females, affecting 
1/200 males and 1/1000 females. It is thought that this differ¬ 
ence in prevalence reflects two thresholds in the liability dis¬ 
tribution; a lower one in males and a higher one in females 
(see Fig. 12-2). A lower male threshold implies that fewer 
disease-causing factors are required to generate the disorder 
in males. 

The liability threshold concept can explain the pattern of 
sibling recurrence risks for pyloric stenosis, shown in Table 
12-1 . Notice that males, having a lower threshold, always have 
a higher risk than females. However, the recurrence risk also 
depends on the sex of the proband. It is higher when the 
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TABLE 12-1 Recurrence Risks (%) for 
Pyloric Stenosis, Subdivided by Gender of 
Affected Probands and Relatives 


FEMALE 

MALE PROBANDS PROBANDS 
RELATIVES LONDON BELFAST LONDON BELFAST 

Brothers 3.8 9.6 9.2 12.5 

Sisters 2.7 3.0 3.8 3.8 

Adapted from Carter CO. Genetics of common single 
malformations. Br Med Bull. 1976:32:21-26. 

*Note that the risks differ somewhat between the two 
populations. 


proband is female than when the proband is male. This 
reflects the concept that females, having a higher liability 
threshold, must be exposed to more disease-causing factors 
than males in order to develop the disease. Thus, a family 
with an affected female must have more genetic and environ¬ 
mental risk factors, producing a higher recurrence risk for 
pyloric stenosis in future offspring. In such a situation, we 
would expect that the highest risk category would be male 
relatives oi female probands; Table 12-1 shows that this is 
indeed the case. 

A number of other congenital malformations are thought 
to correspond to this model. They include isolated”^ cleft lip 
and/or cleft palate, neural tube defects (anencephaly and 
spina bifida), club foot (talipes), and some forms of congeni¬ 
tal heart disease. 

The threshold model applies to many multifac¬ 
torial diseases. It assumes that there is an 
underlying liability distribution in a population 
and that a threshold on this distribution must 
be passed before a disease is expressed. 

Recurrence Risks and Transmission Patterns 

Whereas recurrence risks can be given with confidence for 
single-gene diseases (50% for a completely penetrant autoso¬ 
mal dominant disease, 25% for autosomal recessive diseases, 
and so on), risk estimation is more complex for multifactorial 
diseases. This is because the number of genes contributing to 
the disease is usually not known, the precise allelic constitu¬ 
tion of the parents is not known, and the extent of environ¬ 
mental effects can vary substantially. For most multifactorial 
diseases, empirical risks (i.e., risks based on direct observa¬ 
tion of data) have been derived. To estimate empirical risks, 
a large series of families is examined in which one child (the 
proband) has developed the disease. The relatives of each 
proband are surveyed in order to calculate the percentage 
who have also developed the disease. For example, in North 


*In this context, the term “isolated” means that this is the only 
observed disease feature (i.e., the feature is not part of a larger con¬ 
stellation of findings, as in cleft lip/palate secondary to trisomy 13). 


America neural tube defects are seen in about 2% to 3% of 
the siblings of probands with this condition (Clinical Com¬ 
mentary 12-1). Thus, the recurrence risk for parents who 
have had one child with a neural tube defect is 2% to 3%. For 
conditions that are not lethal or severely debilitating, such as 
cleft lip and cleft palate, recurrence risks can also be estimated 
for the offspring of affected parents. Because risk factors vary 
among diseases, empirical recurrence risks are specific for 
each multifactorial disease. 

In contrast to most single-gene diseases, recurrence risks 
for multifactorial diseases can change substantially from one 
population to another (notice the differences between the 
London and Belfast populations in Table 12-1). This is 
because allele frequencies as well as environmental factors 
can differ among populations. 

Empirical recurrence risks for multifactorial 
diseases are based on studies of large collec¬ 
tions of families. These risks are specific to a 
given population. 

It is sometimes difficult to differentiate polygenic or mul¬ 
tifactorial diseases from single-gene diseases that have reduced 
penetrance or variable expression. Large data sets and good 
family history data are necessary to make the distinction. 
Several criteria are usually used to define multifactorial 
inheritance. 

• The recurrence risk is higher if more than one family member 
is affected. For example, the sibling recurrence risk for a 
ventricular septal defect (VSD, a type of congenital heart 
defect) is 3% if one sibling has had a VSD but increases to 
approximately 10% if two siblings have had VSDs. In con¬ 
trast, the recurrence risk for single-gene diseases remains 
the same regardless of the number of affected siblings. 
This increase does not mean that the family’s risk has actu¬ 
ally changed. Rather, it means that we now have more 
information about the family’s true risk: because they have 
had two affected children, they are probably located higher 
on the liability distribution than a family with only one 
affected child. In other words, they have more risk factors 
(genetic and/or environmental) and are more likely to 
produce an affected child. 

• If the expression of the disease in the proband is more severe, 
the recurrence risk is higher. This is again consistent with 
the liability model, because a more-severe expression 
indicates that the affected person is at the extreme tail of 
the liability distribution (see Fig. 12-2). His or her rela¬ 
tives are thus at a higher risk to inherit disease-causing 
genes. For example, the occurrence of a bilateral (both 
sides) cleft lip/palate confers a higher recurrence risk on 
family members than does the occurrence of a unilateral 
(one side) cleft. 

• The recurrence risk is higher if the proband is of the less 
commonly affected sex (see, for example, the previous dis¬ 
cussion of pyloric stenosis). This is because an affected 
individual of the less susceptible sex is usually at a more 
extreme position on the liability distribution. 
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CLINICAL COMMENTARY 12-1 
Neural Tube Defects 

Neural tube defects (NTDs) include anencephaly, spina bifida, 
and encephalocele, as well as several other less-common 
forms (Fig. 12-3). They are one of the most important classes 
of birth defects, with a newborn prevalence of approximately 
1 per 1000. There is considerable variation in the prevalence 
of NTDs among populations, with an especially high rate 
among some northern Chinese populations (as high as 6 per 
1000 births). The prevalence of NTDs has been decreasing in 
many parts of the United States and Europe during the past 
three decades. 

Normally the neural tube closes at about the fourth week of 
gestation. A defect in closure or a subsequent reopening of the 
neural tube results in an NTD. Spina bifida is the most com¬ 
monly observed NTD and consists of a protrusion of spinal 
tissue through the vertebral column (the tissue usually includes 
meninges, spinal cord, and nerve roots). About 75% of spina 
bifida patients have secondary hydrocephalus, which some¬ 
times in turn produces intellectual disability. Paralysis or muscle 
weakness, lack of sphincter control, and club feet are often 
observed. A study conducted in British Columbia showed that 
survival rates for spina bifida patients have improved dramati¬ 
cally over the past several decades. Less than 30% of such 
children born between 1952 and 1969 survived to 10 years of 
age, but 65% of those born between 1970 and 1986 survived 
to this age. 

Anencephaly is characterized by partial or complete absence 
of the cranial vault and calvarium and partial or complete 
absence of the cerebral hemispheres. At least two thirds of 
anencephalics are stillborn; term deliveries do not survive more 
than a few hours or days. Encephalocele consists of a protru¬ 
sion of the brain into an enclosed sac. It is seldom compatible 
with survival. 

NTDs are thought to arise from a combination of genetic and 
environmental factors. In most populations surveyed thus far, 
empirical recurrence risks for siblings of affected individuals 
range from 2% to 5%. Consistent with a multifactorial model, 
the recurrence risk increases with additional affected siblings. 
A Hungarian study showed that the overall prevalence of NTDs 
in that country was 1/300 births and that the sibling recurrence 
risks were 3%, 12%, and 25% after one, two, and three 
affected offspring, respectively. Recurrence risks tend to be 
slightly lower in populations with lower NTD prevalence rates, 
as predicted by the multifactorial model. Recurrence risk data 
support the idea that the major forms of NTDs are caused by 
shared or similar factors. An anencephalic conception increases 
the recurrence risk for subsequent spina bifida conceptions, 
and vice versa. 

NTDs can usually be diagnosed prenatally, sometimes by 
ultrasound and usually by an elevation in a-fetoprotein (AFP) 
in the maternal serum or amniotic fluid (see Chapter 13). A 
spina bifida lesion can be either open or closed (i.e., covered 
with a layer of skin). Open spina bifida is more likely to be 
detected by an AFP assay. 

A major epidemiological finding is that mothers who supple¬ 
ment their diet with folic acid at the time of conception are less 
likely to produce children with NTDs. This result has been 
replicated in several different populations and is thus well con¬ 
firmed. It has been estimated that approximately 50% to 70% 



FIG 12-3 The major neural tube defects. A, An infant with 
an open spina bifida (meningomyelocele). B, A fetus with 
anencephaly. Note the abnormalities of the orbits of the eye 
and the cranial defect. C, An occipital encephalocele. (A and 
B from Jones KL. Smith's Recognizable Patterns of Human 
Malformation. 6th ed. Philadelphia: Saunders; 2006:705.) 
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CLINICAL COMMENTARY 12-1 
Neural Tube Defects—cont'd 

of NTDs can be avoided simply by dietary folic acid supplemen¬ 
tation. (Traditional prenatal vitamin supplements would not 
have an effect because administration does not usually begin 
until well after the time that the neural tube closes.) Since 
mothers would be likely to ingest similar amounts of folic acid 
from one pregnancy to the next, folic acid deficiency could well 
account for at least part of the elevated sibling recurrence risk 
for NTDs. 

Dietary folic acid is an important example of a nongenetic 
factor that contributes to familial clustering of a disease. 


However, it is likely that there is genetic variation in response 
to folic acid, which helps to explain why most mothers with 
folic acid deficiency do not bear children with NTDs and why 
some who Ingest adequate amounts of folic acid nonetheless 
bear children with NTDs. To address this issue, researchers are 
testing for associations between NTDs and variants in several 
genes whose products (e.g., methylene tetrahydrofolate reduc¬ 
tase) are Involved In folic acid metabolism (see Clinical Com¬ 
mentary 15-6 in Chapter 15 for further information on dietary 
folic acid supplementation and NTD prevention). 


TABLE 12-2 Recurrence Risks for First-, Second-, and Third-Degree Relatives of Probands 


DISEASE 

PREVALENCE IN 

GENERAL POPULATION 

FIRST DEGREE 

DEGREE OF RELATION 

SECOND DEGREE 

THIRD DEGREE 

Cleft lip/palate 

0.001 

0.04 

0.007 

0.003 

Club foot 

0.001 

0.025 

0.005 

0.002 

Congenital hip dislocation 

0.002 

0.005 

0.006 

0.004 


• The recurrence risk for the disease usually decreases rapidly 
in more remotely related relatives (Table 12-2). Although 
the recurrence risk for single-gene diseases decreases by 
50% with each degree of relationship (e.g., an autosomal 
dominant disease has a 50% recurrence risk for offspring 
of affected persons, 25% for nieces or nephews, 12.5% 
for first cousins, and so on), it decreases much more 
quickly for multifactorial diseases. This reflects the fact 
that many genetic and environmental factors must 
combine to produce a trait. All of the necessary risk 
factors are unlikely to be present in less closely related 
family members. 

• If the prevalence of the disease in a population is f (which 
varies between zero and one), the risl^or offspring and sib¬ 
lings of probands is approximately vf. This does not hold 
true for single-gene traits, because their recurrence risks 
are independent of population prevalence. It is not an 
absolute rule for multifactorial traits either, but many such 
diseases do tend to conform to this prediction. Examina¬ 
tion of the risks given in Table 12-2 shows that these three 
diseases follow the prediction fairly well. 

Risks for multifactorial diseases usually 
increase if more family members are affected, 
if the disease has more severe expression, and 
if the affected proband is a member of the 
less commonly affected sex. Recurrence risks 
decrease rapidly with more-remote degrees of 
relationship. In general, the sibling recurrence 
risk is approximately equal to the square 
root of the prevalence of the disease in the 
population. 


Multifactorial versus Single-Gene Inheritance 

It is important to clarify the difference between a multifacto¬ 
rial disease and a single-gene disease in which there is locus 
heterogeneity. In the former case, a disease is caused by the 
simultaneous influence of multiple genetic and environmen¬ 
tal factors, each of which has a relatively small effect. In con¬ 
trast, a disease with locus heterogeneity, such as osteogenesis 
imperfecta, requires only a single mutation to cause it. 
Because of locus heterogeneity, a single mutation at either of 
two or more loci can cause disease; some affected persons 
have one mutation while others have the other mutation. 

In some cases, a trait may be influenced by the combina¬ 
tion of both a single gene with large effects and a multifacto¬ 
rial background in which additional genes and environmental 
factors have small individual effects (Fig. 12-4). Imagine that 
variation in height, for example, is caused by a single locus 
(termed a major gene) and a multifactorial component. Indi¬ 
viduals with the AA genotype tend to be taUer, those with the 
aa genotype tend to be shorter, and those with Aa tend to be 
intermediate. But additional variation is caused by other 
factors (the multifactorial component). Thus, those with the 
aa genotype vary in height from 130 cm to about 170 cm, 
those with the Aa genotype vary from 150 cm to 190 cm, and 
those with the AA genotype vary from 170 to 210 cm. There 
is substantial overlap among the three major genotypes 
because of the influence of the multifactorial background. 
The total distribution of height, which is bell-shaped, is 
caused by the superposition of the three distributions about 
each genotype. 

Many of the diseases to be discussed later can be caused 
by a major gene and/or multifactorial inheritance. That is, 
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135 cm 170 cm 205 cm 

FIG 12-4 The distribution of height, assuming the presence 
of a major gene (genotypes AA, Aa, and aa) combined with 
a multifactorial background. The multifactorial background 
causes variation in height among individuals of each geno¬ 
type. If the distributions of each of the three genotypes were 
superimposed, then the overall distribution of height would 
be approximately normal, as shown by the dotted line. 


there are subsets of the population in which diseases such as 
colon cancer, breast cancer, or heart disease are inherited as 
single-gene disorders (with additional variation in disease 
susceptibility contributed by other genetic and environmen¬ 
tal factors). These subsets usually account for only a small 
percentage of the total number of disease cases. It is neverthe¬ 
less important to identify the responsible major genes, 
because their function can provide important clues to the 
pathophysiology and treatment of the disease. 

Multifactorial diseases can be distinguished 
from single-gene disorders caused by muta¬ 
tions at different loci (locus heterogeneity). 
Sometimes a disease has both single-gene 
and multifactorial components. 

NATURE AND NURTURE: DISENTANGLING 
THE EFFECTS OF GENES AND ENVIRONMENT 

Family members share genes and a common environment. 
Family resemblance in traits such as blood pressure therefore 
reflects both genetic and environmental commonality 
(“nature” and “nurture,” respectively). For centuries, people 
have debated the relative importance of these two types of 
factors. It is a mistake, of course, to view them as mutually 
exclusive. Few traits are influenced only by genes or only by 
environment. Most are influenced by both. 

Determining the relative influence of genetic and environ¬ 
mental factors can lead to a better understanding of disease 
etiology. It can also help in the planning of public health 
strategies. A disease in which hereditary influence is relatively 
small, such as lung cancer, may be prevented most effectively 
through emphasis on lifestyle changes (avoidance of tobacco). 
When a disease has a relatively larger hereditary component, 
as in breast cancer, examination of family history should be 
emphasized in addition to lifestyle modification. 

In the following sections, we review two research strategies 
that are often used to estimate the relative influence of genes 



FIG 12-5 Monozygotic twins, showing a striking similarity 
in physical appearance. Both twins developed myopia as 
teenagers. 


and environment: twin studies and adoption studies. We then 
discuss methods that aim to delineate the individual genes 
responsible for multifactorial diseases. 

Twin Studies 

Twins occur with a frequency of about 1/100 births in popula¬ 
tions of European ancestry. They are slightly more common 
among Africans and a bit less common among Asians. Mono¬ 
zygotic (MZ, or identical) twins originate when the develop¬ 
ing embryo divides to form two separate but genetically 
identical embryos. Because they are genetically identical, MZ 
twins are an example of natural clones. Their physical appear¬ 
ances can be strikingly similar (Fig. 12-5). Dizygotic (DZ, or 
fraternal) twins are the result of a double ovulation followed 
by the fertilization of each egg by a different sperm. ' Thus, DZ 
twins are genetically no more similar than siblings. Because 
two different sperm cells are required to fertilize the two eggs, 
it is possible for each DZ twin to have a different father. 

Because MZ twins are genetically identical, any differences 
between them should be due only to environmental effects. 
MZ twins should thus resemble each other very closely for 
traits that are strongly influenced by genes. DZ twins provide 
a convenient comparison: their environmental differences 
should be similar to those of MZ twins, but their genetic dif¬ 
ferences are as great as those between siblings. Twin studies 
thus usually consist of comparisons between MZ and DZ 
twins. If both members of a twin pair share a trait (e.g., cleft 


^WhUe MZ twinning rates are quite constant across populations, DZ 
twinning rates vary somewhat. DZ twinning increases with maternal 
age until about age 40 years, after which the rate declines. The fre¬ 
quency of DZ twinning has increased dramatically in developed 
countries during the past several decades because of the use of 
ovulation-inducing drugs. 
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lip), they are said to be concordant. If they do not share the 
trait, they are discordant. For a trait determined completely 
by genes, MZ twins should always be concordant, and DZ 
twins should be concordant less often. Like siblings, DZ twins 
share only 50% of their DNA because each parent transmits 
half of his or her DNA to each offspring. Concordance rates 
can differ between opposite-sex DZ twin pairs and same-sex 
DZ pairs for some traits, such as those that have different 
frequencies in males and females. For such traits, only same- 
sex DZ twin pairs should be used when comparing MZ and 
DZ concordance rates. 

A concordance estimate would not be appropriate for 
quantitative traits, such as blood pressure or height. Here the 
intraclass correlation coefficient is used. This statistic varies 
between -1.0 and -l-l.O and measures the degree of homoge¬ 
neity of a trait in a sample of individuals. For example, we 
may wish to assess the degree of similarity between twins for 
a trait such as height. The measurements are made in a col¬ 
lection of twins, and correlation coefficients are estimated 
separately for the MZ sample and the DZ sample. If a trait 
were determined entirely by genes, we would expect the cor¬ 
relation coefficient for MZ pairs to be 1.0 (i.e., each pair of 
twins would have exactly the same height). A correlation 
coefficient of 0.0 would mean that the similarity between MZ 
twins for the trait in question is no greater than chance. 
Because DZ twins share half of their DNA, we would expect 


a DZ correlation coefficient of 0.50 for a trait determined 
entirely by genes. 

Monozygotic (identical) twins are the result of 
an early cleavage of the embryo, whereas 
dizygotic (fraternal) twins are caused by the 
fertilization of two eggs by two sperm cells. 
Comparisons of concordance rates and corre¬ 
lations in MZ and DZ twins help to estimate the 
extent to which a trait is influenced by genes. 

Concordance rates and correlation coefficients for a 
number of traits are given in Table 12-3. The concordance 
rates for contagious diseases like measles are quite similar in 
MZ and DZ twins. This is expected, because most contagious 
diseases are unlikely to be influenced markedly by genes. On 
the other hand, the concordance rates for schizophrenia are 
quite dissimilar between MZ and DZ twins, indicating a 
sizable genetic component for this disease. The MZ correla¬ 
tion for dermatoglyphics (fingerprints), a series of traits 
determined almost entirely by genes, is close to 1.0. 

Correlations and concordance rates in MZ and DZ twins 
can be used to measure the heritability of multifactorial 
traits. Essentially, heritability is the percentage of population 
variation in a trait that is due to genes (statistically, it is the 
proportion of the total variance of a trait that is caused by 




DZ, Dizygotic; IQ, intelligence quotient; MZ, monozygotic. 

*These figures were compiled from a large variety of sources and represent primarily European and U.S. populations. Heritability is estimated 
here using the formula 2(cmz - Cdz). 

^Because these are quantitative traits, correlation coefficients are given rather than concordance rates. 

^Several heritability estimates exceed 1.0. Because it is impossible for >100% of the variance of a trait to be genetically determined, these 
values indicate that other factors, such as shared environmental factors, must be operating. 
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genes). A simple formula for estimating heritability (h) from 
twin correlations or concordance rates is as follows: 

_ Cmz ~ Cpz 
1 “ Cdz 

where Cmz is the concordance rate (or intraclass correlation) 
for MZ twins and Cdz is the concordance rate (or intraclass 
correlation) for DZ twins. As this formula illustrates, traits 
that are largely determined by genes result in a heritability 
estimate that approaches 1.0 (i.e., Cmz approaches 1.0, and Cdz 
approaches 0.5). As the difference between MZ and DZ con¬ 
cordance rates becomes smaller, heritability approaches zero. 
Correlations and concordance rates in other types of relatives 
(e.g., between parents and offspring) can also be used to 
measure heritability. 

Like recurrence risks, heritability values are specific for the 
population in which they are estimated. However, there is 
usually agreement from one population to another regarding 
the general range of heritability estimates of most traits (e.g., 
the heritability of height is almost always high, and the heri¬ 
tability of contagious diseases is almost always low). The same 
is true of empirical recurrence risks. 

Comparisons of correlations and concordance 
rates in MZ and DZ twins allow the estimation 
of heritability, a measure of the percentage of 
population variation in a disease that can be 
attributed to genes. 

At one time, twins were thought to provide a perfect 
“natural laboratory” in which to determine the relative influ¬ 
ences of genetics and environment. But several difficulties 
arise. One of the most important is the assumption that the 
environments of MZ and DZ twins are equally similar. MZ 
twins are often treated more similarly than DZ twins. A 
greater similarity in environment can make MZ twins more 
concordant for a trait, inflating the apparent influence of 
genes. In addition, MZ twins may be more likely to seek the 
same type of environment, further reinforcing environmental 
similarity. On the other hand, it has been suggested that some 
MZ twins tend to develop personality differences in an 
attempt to assert their individuality. 

Another difficulty is that the uterine environments of dif¬ 
ferent pairs of MZ twins can be more or less similar, depend¬ 
ing on whether there are two amnions and two chorions, two 
amnions and one shared chorion, or one shared amnion and 
one shared chorion. In addition, somatic mutations can occur 
during mitotic divisions of the cells of MZ twin embryos after 
cleavage occurs. Thus, the MZ twins might not be quite 
“identical,” especially if a mutation occurred early in the 
development of one of the twins. Finally, methylation pat¬ 
terns, which can influence the transcription of specific genes, 
become more dissimilar in MZ twin pairs as they age. This 
dissimilarity is greater when the twins adopt markedly differ¬ 
ent habits and lifestyles (e.g., when one twin smokes ciga¬ 
rettes and the other does not). 


Of the various problems with the twin method, the greater 
degree of environmental sharing among MZ twins is perhaps 
the most serious. One way to circumvent this problem, at 
least in part, is to study MZ twins who were raised in separate 
environments. Concordance among these twin pairs should 
be caused by genetic, rather than environmental, similarities. 
As one might expect, it is not easy to find such twin pairs. A 
major effort to do so has been undertaken by researchers at 
the University of Minnesota, whose studies have shown a 
remarkable congruence among MZ twins reared apart, even 
for many behavioral traits. However, these studies must be 
viewed with caution, because the sample sizes are relatively 
small and because many of the twin pairs had at least some 
contact with each other before they were studied. 

Although twin studies provide valuable infor¬ 
mation, they are also affected by certain biases. 
The most serious is greater environmental 
similarity between MZ twins than between DZ 
twins. Other biases include somatic mutations 
that might affect only one MZ twin and differ¬ 
ences in the uterine environments of twins. 

Adoption Studies 

Studies of adopted children are also used to estimate the 
genetic contribution to a multifactorial trait. Offspring who 
were born to parents who have a disease but who were 
adopted by parents lacking the disease can be studied to find 
out whether the offspring develop the disease. In some cases, 
these adopted persons develop the disease more often than 
do children in a comparative control population (i.e., adopted 
children who were born to parents who do not have the 
disease). This provides evidence that genes may be involved 
in causing the disease, because the adopted children do not 
share an environment with their affected natural parents. For 
example, schizophrenia is seen in 8% to 10% of adopted 
children whose natural parent had schizophrenia, whereas it 
is seen in only 1% of adopted children of unaffected parents. 

As with twin studies, several precautions must be exercised 
in interpreting the results of adoption studies. First, prenatal 
environmental influences could have long-lasting effects on 
an adopted child. Second, children are sometimes adopted 
after they are several years old, ensuring that some nongenetic 
influences have been imparted by the natural parents. Finally, 
adoption agencies sometimes try to match the adoptive 
parents with the natural parents in terms of attributes such 
as socioeconomic status. All of these factors could exaggerate 
the apparent influence of biological inheritance. 

Adoption studies provide a second means of 
estimating the influence of genes on multifac¬ 
torial diseases. They consist of comparing 
disease rates among the adopted offspring of 
affected parents with the rates among adopted 
offspring of unaffected parents. As with the 
twin method, several biases can influence 
these studies. 
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These reservations, as well as those summarized for 
twin studies, underscore the need for caution in basing con¬ 
clusions on twin and adoption studies. These approaches do 
not provide definitive measures of the role of genes in mul¬ 
tifactorial disease, nor can they identify specific genes respon¬ 
sible for disease. Instead, they provide a preliminary indication 
of the extent to which a multifactorial disease may be influ¬ 
enced by genetic factors. Methods for the direct detection 
of genes underlying multifactorial traits are summarized in 
Box 12-1. 

THE GENETICS OF COMMON DISEASES 

Having discussed the principles of multifactorial inheritance, 
we turn next to a discussion of the common multifactorial 
disorders themselves. Some of these disorders, the congenital 
malformations, are by definition present at birth. Others, 
including heart disease, cancer, diabetes, and most psychiatric 
disorders, are seen primarily in adolescents and adults. 
Because of their complexity, unraveling the genetics of these 
disorders is a daunting task. Nonetheless, significant progress 
is now being made. 

Congenital Malformations 

Approximately 2% of newborns present with a congenital 
malformation (i.e., one that is present at birth); most of these 
conditions are considered to be multifactorial in etiology. 
Some of the more common congenital malformations are 
listed in Table 12-4. In general, sibling recurrence risks for 
most of these disorders range from 1% to 5%. 

Some congenital malformations, such as cleft lip/palate 
and pyloric stenosis, are relatively easy to repair and thus do 
not cause lasting problems. Others, such as neural tube 
defects, usually have more severe consequences. Although 
some cases of congenital malformations can occur in the 
absence of any other problems, it is quite common for them 
to be associated with other disorders. For example, cleft lip/ 
palate is often seen in babies with trisomy 13, and congenital 
heart defects are seen in many syndromes, including trisomy 
of chromosomes 13, 18, and 21. 

Considerable progress is now being made in isolating 
single genes that can cause congenital malformations. Many 


TABLE 12-4 Prevalence Rates of 

Common Congenital Malformations in 
Persons of European Descent 

DISORDER 

APPROXIMATE PREVALENCE 
PER 1000 BIRTHS 

Cleft lip/palate 

1.0 

Club foot 

1.0 

Congenital heart defects 4.0-8.0 

Hydrocephaly 

0.5-2.5 

Isolated cleft palate 

0.4 

Neural tube defects 

1.0-3.0 

Pyloric stenosis 

3.0 


of these, including the HOX, PAX, and TBX families of genes, 
were discussed in Chapter 10. Another example is the RET 
proto-oncogene, which is responsible for some cases of 
Hirschsprung disease. However, the causes of many cases of 
this disorder remain undiscovered. Indeed, most of the 
genetic factors that contribute to important congenital mal¬ 
formations (e.g., neural tube defects, common congenital 
heart defects, cleft lip/palate) are as yet unidentified. 

Environmental factors have also been shown to cause 
some congenital malformations. An example is thalidomide, 
a sedative used during pregnancy in the early 1960s (and 
recently reintroduced for the treatment of dermatological 
conditions such as leprosy). When ingested during early preg¬ 
nancy, this drug often can cause phocomelia (severely short¬ 
ened limbs) in babies. Maternal exposure to retinoic acid, 
which is used to treat acne, can cause congenital defects of 
the heart, ear, and central nervous system. Maternal rubella 
infection can cause congenital heart defects. Other environ¬ 
mental factors that can cause congenital malformations are 
discussed in Chapter 15. 

Congenital malformations are seen in roughly 
1 of every 50 live births. Most of them are 
considered to be multifactorial disorders. Spe¬ 
cific genes and environmental causes have 
been detected for some congenital malforma¬ 
tions, but the causes of most congenital mal¬ 
formations remain largely unknown. 

Multifactorial Disorders in the Adult Population 

Until recently, very little was known about specific genes 
responsible for common adult diseases. With more powerful 
laboratory and analytical techniques, this situation is chang¬ 
ing. We next review recent progress in understanding the 
genetics of the major common adult diseases. 

Cardiovascular Disorders 

Heart disease. Heart disease is the leading cause of death 
worldwide, and it accounts for approximately 25% of all 
deaths in the United States. The most common underlying 
cause of heart disease is coronary artery disease (CAD), 
which is caused by atherosclerosis (a narrowing of the coro¬ 
nary arteries resulting from the formation of lipid-laden 
lesions). This narrowing impedes blood flow to the heart and 
can eventually result in a myocardial infarction (death of 
heart tissue caused by an inadequate supply of oxygen). When 
atherosclerosis occurs in arteries that supply blood to the 
brain, a stroke can result. A number of risk factors for CAD 
have been identified, including obesity, cigarette smoking, 
hypertension, elevated cholesterol level, and positive family 
history (usually defined as having one or more affected first- 
degree relatives). Many studies have examined the role of 
family history in CAD, and they show that a person with a 
positive family history is at least twice as likely to suffer from 
CAD than is a person with no family history. Generally, these 
studies also show that the risk is higher if there are more 
affected relatives, if the affected relative is female (the less 
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BOX 12-1 Finding Genes That Contribute to Multifactorial Disease 


As mentioned in the text, twin and adoption studies are not 
designed to reveal specific genes that cause multifactorial 
diseases. The identification of specific causative genes is an 
important goal, because only then can we begin to understand 
the underlying biology of the disease and undertake to correct 
the defect. For complex multifactorlal traits, this is a formidable 
task because of locus heterogeneity, the interactions of mul¬ 
tiple genes, decreased penetrance, age-dependent onset, and 
phenocopies (persons who have a phenotype, such as breast 
cancer, but who do not carry a known disease-causing muta¬ 
tion, such as a BRCA1 alteration). Fortunately, recent techno¬ 
logical advances promise to make this goal more attainable. 
Here, we discuss several approaches that are used to Identify 
the genes underlying multifactorial traits. 

One way to search for these genes is to use conventional 
linkage analysis, as described in Chapter 8. Disease families 
are collected, a single-gene mode of inheritance is assumed, 
and linkage analysis is undertaken with a large series of 
marker polymorphisms that span the genome (this is termed 
a genome scan). If a sufficiently large LOD score (see 
Chapter 8) is obtained with a polymorphism, it is assumed 
that the region around this polymorphism might contain a 
disease-causing gene. This approach is sometimes success¬ 
ful, especially when there are subsets of families in which a 
single-gene mode of Inheritance is seen (e.g., autosomal dom¬ 
inant, autosomal recessive). This was the case, for example, 
with familial breast cancer and familial Alzheimer disease, 
where some families presented a clear autosomal dominant 
mode of inheritance. 

With many multifactorial disorders, however, such subsets 
are not readily apparent. Because of obstacles such as hetero¬ 
geneity and phenocopies, traditional linkage analyses are often 
Impractical. One alternative to traditional linkage analysis is the 
affected sib-pair method. The logic of this approach is simple: 
If two siblings are both affected by a genetic disease, we would 
expect to see increased sharing of marker alleles In the 
genomic region that contains a susceptibility gene. To conduct 
an analysis using this approach, we begin by collecting DMA 
samples from a large number of sib pairs in which both 
members of the pair are affected by the disease. Then a 
genome scan is undertaken, and the proportion of affected sib 
pairs who share the same allele Is estimated for each polymor¬ 
phism. Because siblings share half their DNA (see Chapter 4), 
we would expect this proportion to be 50% for marker poly¬ 
morphisms that are not linked to a disease-susceptibility locus. 
However, if we find that siblings share the same allele for a 
marker polymorphism more than half the time (say, 75% of the 
time), this would be evidence that the marker Is linked to a 
susceptibility locus. This approach was used, for example, to 
show that the genes In the HLA region contribute to suscepti¬ 
bility for type 1 diabetes. 

The affected sib-pair method has the advantage that one 
does not have to assume a specific mode of inheritance. In 
addition, the method is unaffected by reduced penetrance, 
because both members of the sib pair must be affected to be 
included In the analysis. It Is especially useful for disorders with 
late age of onset (e.g., prostate cancer), for which it would be 


difficult to assemble multigeneratlonal families from whom 
DNA samples could be taken. Affected sIb-paIr analyses are 
sometimes made more powerful by selecting subjects with 
extreme values of a trait (e.g., sib pairs with very high blood 
pressure) to enrich the sample for genes likely to contribute to 
the trait. A variation on this approach is to sample sib pairs that 
are highly discordant for a trait (e.g., one with very high blood 
pressure and one with very low blood pressure) and then to 
look for markers in which there is less allele sharing than the 
expected 50%. 

During the past decade or so, genome-wide association 
studies (GWAS, see Chapter 8) have become common in 
searches for genes that cause complex diseases. GWAS 
became more practical after the Human Genome Project devel¬ 
oped dense sets of SNPs that span the human genome. It Is 
now common to use microarrays that can assay millions of 
SNPs or high-throughput DNA sequencing strategies in a col¬ 
lection of cases and controls. GWAS are carried out using both 
SNPs as well as larger CNVs (which can contain multiple 
genes). Because the differences in the frequencies of disease- 
causing variants can be quite small, thousands or even tens of 
thousands of cases and controls are often tested In these 
studies. A major challenge with GWAS is that the variants 
detected by this technique typically have only a small effect on 
disease risk (5-20%), and their biologic significance is usually 
difficult to Interpret. Confidence in the plausibility of disease- 
associated genes increases if an appreciable number of the 
genes encode products In the same disease-associated 
pathway (e.g., the folate metabolism pathway for neural tube 
defects or the dopaminergic pathway for schizophrenia). 

As the cost of exome and whole-genome sequencing has 
decreased, it has become increasingly common to compare 
DNA sequences in affected cases and matched controls. As 
discussed in Chapter 8, this approach has the advantage that 
it can detect all variants, including rare alleles, that differ in the 
two groups. Because these alleles are rare, large sample sizes 
must be used to detect their effects. This problem can be 
overcome somewhat by adding the effects of all disease- 
associated alleles at a given locus together (this Is termed a 
burden test). In addition, there has been a resurgence of inter¬ 
est in sequencing families, in which even a rare allele is likely 
to be observed in multiple cases. This approach has been suc¬ 
cessful, for example. In identifying new genes associated with 
late-onset Alzheimer disease. 

In addition to searching for variants in protein-coding DNA, 
there is now considerable interest in identifying non-coding 
regulatory variants that may contribute to complex diseases. 
Indeed, about 80% of the SNPs associated with complex dis¬ 
eases In GWAS are found in non-coding DNA. This motivates 
whole-genome sequencing studies. Our understanding of vari¬ 
ation in regulatory sequences such as enhancers and promot¬ 
ers is limited, however, but initial results (e.g., the association 
between enhancer elements and prostate cancer, discussed in 
the text) suggest that much will be learned by analyzing non¬ 
coding variation in the genome. Studies of epigenetic regula¬ 
tion (e.g., methylation and chromatin modification patterns) are 
also likely to yield new Insights. 
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commonly affected sex) rather than male, and if the age of 
onset in the affected relative is early (before 55 years of age). 
For example, one study showed that men between the ages of 
20 and 39 years had a threefold increase in CAD risk if they 
had one affected first-degree relative. This risk increased to 
13-fold if there were two first-degree relatives affected with 
CAD before 55 years of age. 

What part do genes play in the familial clustering of CAD? 
Because of the key role of lipids in atherosclerosis, many 
studies have focused on the genetic determination of varia¬ 
tion in circulating lipoprotein levels. An important advance 
was the identification of the gene that encodes the low-density 
lipoprotein (LDL) receptor. Heterozygosity for a mutation in 
this gene roughly doubles LDL cholesterol levels and is seen 
in approximately 1 in 500 persons. (This condition, known 
as familial hypercholesterolemia, is described further in Clin¬ 
ical Commentary 12-2.) Mutations in the gene encoding apo- 
lipoprotein B, which are seen in about 1 in 1000 persons, are 
another genetic cause of elevated LDL cholesterol. These 
mutations occur in the portion of the gene that is responsible 
for binding of apolipoprotein B to the LDL receptor, and they 
increase circulating LDL cholesterol levels by 50% to 100%. 
Dozens of other genes involved in lipid metabolism and 
transport have been identified, including those genes that 
encode various apolipoproteins (these are the protein com¬ 
ponents of lipoproteins) (Table 12-5). In addition, several 
genes whose protein products contribute to inflammation 
have been associated with CAD, reflecting the critical role of 
inflammation in generating atherosclerotic plaques. Func¬ 
tional analysis of these genes is leading to increased under¬ 
standing and more effective treatment of CAD. 

Environmental factors, many of which are easily modified, 
are also important causes of CAD. There is abundant 


epidemiological evidence that cigarette smoking and obesity 
increase the risk of CAD, whereas exercise and a diet low in 
saturated fats decrease the risk. Indeed, the approximate 60% 
reduction in age-adjusted mortality due to CAD and stroke 
in the United States since 1950 is usually attributed to a 
decrease in the percentage of adults who smoke cigarettes, 
decreased consumption of saturated fats, improved medical 
care, and increased emphasis on healthy lifestyle factors such 
as exercise. 

Another form of heart disease is cardiomyopathy, an 
abnormality of the heart muscle that leads to inadequate 
cardiac function. Cardiomyopathy is a common cause of 
heart failure, resulting in approximately 10,000 deaths annu¬ 
ally in the United States. Nearly 100 different genes have been 
linked to cardiomyopathy. Hypertrophic cardiomyopathy, 
one major form of the disease, is characterized by thickening 
(hypertrophy) of portions of the left ventricle and is seen in 
as many as 1 in 500 adults. About half of hypertrophic car¬ 
diomyopathy cases are familial and are caused by autosomal 
dominant mutations in any of the multiple genes that encode 
various components of the cardiac sarcomere. The most 
commonly mutated genes are those that encode the (J-myosin 
heavy chain (35% of familial cases), myosin-binding protein 
C (20% of cases), and troponin T (15% of cases). 

In contrast to the hypertrophic form of cardiomyopathy, 
dilated cardiomyopathy, which is seen in about I in 2500 
persons, consists of increased size and impaired contraction 
of the ventricles. The end result is impaired pumping of 
the heart. This disease is familial in about one third of 
affected persons; although autosomal dominant mutations 
are most common, mutations can also be X-linked or mito¬ 
chondrial. The most commonly mutated gene encodes titin, 
a cytoskeletal protein. Other genes that can cause dilated 


TABLE 12-5 Lipoprotein Genes Known to Contribute to Coronary Heart Disease Risk 


CHROMOSOME 


GENE 

LOCATION 

FUNCTION OF PROTEIN PRODUCT 

Apolipoprotein A-l 

11q 

HDL component; LCAT cofactor 

Apolipoprotein A-IV 

11q 

Component of chylomicrons and HDL; may influence HDL metabolism 

Apolipoprotein C-lll 

11q 

Allelic variation associated with hypertriglyceridemia 

Apolipoprotein B 

2p 

Ligand for LDL receptor; involved in formation of VLDL, LDL, IDL, and 
chylomicrons 

Apolipoprotein D 

2p 

HDL component 

Apolipoprotein C-l 

19q 

LCAT activation 

Apolipoprotein C-ll 

19q 

Lipoprotein lipase activation 

Apolipoprotein E 

19q 

Ligand for LDL receptor 

Apolipoprotein A-ll 

Ip 

HDL component 

LDL receptor 

19p 

Uptake of circulating LDL particles 

Lipoprotein(a) 

6q 

Cholesterol transport 

Lipoprotein lipase 

8p 

Hydrolysis of lipoprotein lipids 

Hepatic triglyceride lipase 

15q 

Hydrolysis of lipoprotein lipids 

LCAT 

16q 

Cholesterol esterification 

Cholesterol ester transfer protein 

16q 

Facilitates transfer of cholesterol esters and phospholipids between 
lipoproteins 


Adapted in part from King RA, Rotter Jl, eds. The Genetic Basis of Common Diseases. 2nd ed. New York: Oxford University Press: 2002. 
HDL, High-density lipoprotein; IDL, Intermediate-density lipoprotein; LCAT, lecithin cholesterol acyltransferase; LDL, low-density lipoprotein; 
VLDL, very-low-density lipoprotein. 
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CLINICAL COMMENTARY 12-2 
Familial Hypercholesterolemia 

Autosomal dominant familial hypercholesterolemia (FH) is an 
important cause of heart disease, accounting for approximately 
5% of myocardial infarctions (Mis) in persons younger than 
60 years. FH is one of the most common autosomal dominant 
disorders: in most populations surveyed to date, about 1 in 
500 persons is a heterozygote. Plasma cholesterol levels are 
approximately twice as high as normal (i.e., about 300-400 mg/ 
dL), resulting in substantially accelerated atherosclerosis and 
the occurrence of distinctive cholesterol deposits in skin 
and tendons, called xanthomas (Fig. 12-6). Data compiled from 
multiple studies showed that approximately 75% of men with 
FH developed coronary artery disease, and 50% had a fatal Ml, 
by age 60 years. The corresponding percentages for women 
were lower (45% and 15%, respectively), because women 
generally develop heart disease at a later age than men. 

Consistent with Hardy-Weinberg predictions (see Chapter 4), 
about 1/1,000,000 births is homozygous for the FH gene. 
Homozygotes are severely affected, with cholesterol levels 
ranging from 600 to 1200 mg/dL. Most homozygotes experi¬ 
ence Mis before 20 years of age, and an Ml at 18 months of 
age has been reported. Without treatment, most FH homozy¬ 
gotes die before the age of 30 years. 

All cells require cholesterol as a component of their plasma 
membrane. They can either synthesize their own cholesterol, 
or preferentially they obtain it from the extracellular environ¬ 
ment, where it is carried primarily by low-density lipoprotein 
(LDL). In a process known as endocytosis, LDL-bound choles¬ 
terol is taken into the cell via LDL receptors on the cell's 
surface. FH is caused by a reduction in the number of func¬ 
tional LDL receptors on cell surfaces. Cellular cholesterol 
uptake is reduced, and circulating cholesterol levels increase. 

Much of what we know about endocytosis has been learned 
through the study of LDL receptors. The process of endocyto¬ 
sis and the processing of LDL in the cell is described in detail 
in Figure 12-7. These processes result in a fine-tuned regulation 
of cholesterol levels within cells, and they influence the level 
of circulating cholesterol as well. 



FIG 12-6 Xanthomas, seen here on the knuckles, are often 
seen in patients with familial hypercholesterolemia. 


The identification of the LDL receptor gene (LDLR) in 1984 
was a critical step in understanding exactly how LDL receptor 
defects cause FH. More than a thousand different mutations, 
two thirds of which are missense and nonsense substitutions, 
have been identified. Most of the remaining mutations are 
insertions and deletions, many of which arise from unequal 
crossovers (see Chapters 5 and 6) that occur between Alu 
repeat sequences (see Chapter 2) scattered throughout the 
gene. The LDLR mutations can be grouped into five broad 
classes, according to their effects on the activity of the 
receptor: 

• Class I mutations in LDLR result in no detectable protein 
product. Thus, heterozygotes would produce only half the 
normal number of LDL receptors. 

• Class II mutations result in production of the LDL receptor, 
but it is altered to the extent that it cannot leave the endo¬ 
plasmic reticulum. It is eventually degraded. 

• Class III mutations produce an LDL receptor that is capable 
of migrating to the cell surface but is incapable of normal 
binding to LDL. 

• Class IV mutations, which are comparatively rare, produce 
receptors that are normal except that they do not migrate 
specifically to coated pits and thus cannot carry LDL into the 
cell. 

• Class V mutations produce an LDL receptor that cannot 
disassociate from the LDL particle after entry into the cell. 
The receptor cannot return to the cell surface and is degraded. 
Each class of mutations reduces the number of effective LDL 

receptors, resulting in decreased LDL uptake and hence ele¬ 
vated levels of circulating cholesterol. The number of effective 
receptors is reduced by about half in FH heterozygotes, and 
homozygotes have virtually no functional LDL receptors. 

Understanding the defects that lead to FH has helped in the 
development of effective therapies for the disorder. Dietary 
reduction of cholesterol (primarily through the reduced intake 
of saturated fats) has only modest effects on cholesterol levels 
in FH heterozygotes. Because cholesterol is reabsorbed into 
the gut and then recycled through the liver (where most cho¬ 
lesterol synthesis takes place), serum cholesterol levels can be 
reduced by the administration of bile-acid absorbing resins, 
such as cholestyramine. The absorbed cholesterol is excreted. 
It is interesting that reduced recirculation from the gut causes 
the liver cells to form additional LDL receptors, lowering circu¬ 
lating cholesterol levels. However, the decrease in intracellular 
cholesterol also stimulates cholesterol synthesis by liver cells, 
so the overall reduction in plasma LDL is only about 15% to 
20%. This treatment is much more effective when combined 
with one of the statin drugs (e.g., lovastatin, pravastatin), which 
reduce cholesterol synthesis by inhibiting 3-hydroxy-3- 
methylglutaryi coenzyme A (HMG-CoA) reductase. Decreased 
synthesis leads to further production of LDL receptors. When 
these therapies are used in combination, serum cholesterol 
levels in FH heterozygotes can often be reduced to approxi¬ 
mately normal levels. 

The picture is less encouraging for FH homozygotes. The 
therapies mentioned can enhance cholesterol elimination and 
reduce its synthesis, but they are largely ineffective in homo¬ 
zygotes because these persons have few or no LDL receptors. 







CLINICAL COMMENTARY 12-2 
Familial Hypercholesterolemia—cont'd 



FIG 12-7 The process of receptor-mediated endocytosis. 1, The low-density lipoprotein (LDL) 
receptors, which are glycoproteins, are synthesized in the endoplasmic reticulum of the cell. 
2, They pass through the Golgi apparatus to the cell surface, where part of the receptor protrudes 
outside the cell. 3, The circulating LDL particle Is bound by the LDL receptor and localized in 
cell-surface depressions called coated pits (so named because they are coated with a protein 
called clathrin). 4, The coated pit Invaginates, bringing the LDL particle inside the cell. 5, Once 
inside the cell, the LDL particle is separated from the receptor, taken into a lysosome, and broken 
down into its constituents by lysosomal enzymes. 6, The LDL receptor Is recirculated to the cell 
surface to bind another LDL particle. Each LDL receptor goes through this cycle approximately 
once every 10 minutes, even if it is not occupied by an LDL particle. 7, Free cholesterol is 
released from the lysosome for incorporation into cell membranes or metabolism into bile acids 
or steroids. Excess cholesterol can be stored In the cell as a cholesterol ester or removed from 
the cell by association with high-density lipoprotein (fHDL). 8, As cholesterol levels in the cell 
rise, cellular cholesterol synthesis is reduced by inhibition of the rate-limiting enzyme, ElMG-CoA 
reductase. 9, Rising cholesterol levels also Increase the activity of acyl-coenzyme Axholesterol 
acyltransferase (ACAT), an enzyme that modifies cholesterol for storage as cholesterol esters. 
10, In addition, the number of LDL receptors is decreased by lowering the transcription rate of 
the LDL receptor gene itself. This decreases cholesterol uptake. 


Continued 
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CLINICAL COMMENTARY 12-2 
Familial Hypercholesterolemia—cont'd 

Liver transplants, which provide hepatocytes that have normal 
LDL receptors, have been successful in some cases, but this 
option is often limited by a lack of donors. Plasma exchange, 
carried out every 1 to 2 weeks, in combination with drug 
therapy, can reduce cholesterol levels by about 50%. However, 
this therapy is difficult to continue for long periods. Somatic 
cell gene therapy, in which hepatocytes carrying normal LDL 
receptor genes are introduced into the portal circulation, is now 
being tested (see Chapter 13). It might eventually prove to be 
an effective treatment for FH homozygotes. 

As discussed in the main text, FH can also be caused by 
inherited mutations in the gene that encodes apolipoprotein B. 
In addition, a small number of FH cases are caused by muta¬ 
tions in the gene that encodes PCSK9 (proprotein convertase 
subtilisin/kexin type 9), an enzyme that plays a key role in 
degrading LDL receptors. Gain-of-function mutations in the 
PCSK9 gene reduce the number of LDL receptors, causing 
FH. Loss-of-function mutations in this gene can increase 
the number of LDL receptors, resulting in exceptionally low 


circulating LDL levels. These findings have led to the develop¬ 
ment of drugs that inhibit PCSK9 activity, thus lowering LDL 
cholesterol levels. These drugs, which are in late-stage clinical 
trials, can reduce LDL cholesterol levels by approximately 50% 
In the general population of persons with hypercholesterolemia 
and produce significant effects even in those who are using 
statin drugs. 

The FH story illustrates how medical genetics research has 
made important contributions to both the understanding of 
basic cell biology and advancements in clinical therapy. The 
process of receptor-mediated endocytosis, elucidated largely 
by research on the LDL receptor defects, is of fundamental 
significance for cellular processes throughout the body. Equally, 
this research, by clarifying how cholesterol synthesis and 
uptake can be modified, has led to significant improvements 
in therapy for this important cause of heart disease. The dis¬ 
covery of rare mutations in PCSK9 has led to PCSK9 inhibitor 
drugs that may benefit millions of persons with high choles¬ 
terol levels. 


cardiomyopathy encode other cytoskeletal proteins, includ¬ 
ing actin, cardiac troponin T, desmin, and components of the 
dystroglycan-sarcoglycan complex. (Recall from Chapter 5 
that abnormalities of the latter proteins can also cause mus¬ 
cular dystrophies.) 

Mutations have also been identified in more than a dozen 
genes that cause the long QT (LQT) syndrome. LQT, which 
is seen in approximately 1 in 2000 persons, describes the 
characteristically elongated QT interval in the electrocardio¬ 
gram of affected individuals, indicative of delayed cardiac 
repolarization. This disorder, which can be caused either by 
inherited mutations or by exposure to drugs that block potas¬ 
sium channels, predisposes affected persons to potentially 
fatal cardiac arrhythmia. An autosomal dominant form, 
known as Romano-Ward syndrome, can be caused by loss- 
of-function mutations in genes that encode potassium chan¬ 
nels (such as KCNQl, KCNH2, KCNEl, KCNE2, or KCNJ2). 
These mutations delay cardiac repolarization. Gain-of-func- 
tion mutations in several of these same genes have been 
shown to produce a shortened QT interval, as might be 
expected. Romano-Ward syndrome can also be caused by 
gain-of-function mutations in sodium or calcium channel 
genes iSCN5A and CACNAIC, respectively), which result in 
a prolonged depolarizing current. (Other examples of muta¬ 
tions that can cause LQT are given in Table 12-6.) 

An autosomal recessive form of LQT syndrome, known as 
Jervell-Lange-Nielsen syndrome, is less common than the 
Romano-Ward syndrome but is associated with a longer QT 
interval, a higher incidence of sudden cardiac death, and 
sensorineural deafness. This syndrome is caused by muta¬ 
tions in either KCNQl or KCNEl. Because LQT syndrome 
can be difficult to diagnose clinically, genetic testing enables 
more accurate diagnosis of affected persons. In addition, the 
identification of disease-causing genes and their protein 
products is now guiding the development of drug therapy to 


activate the encoded ion channels. Because cardiac arrhyth¬ 
mias account for most of the 300,000 sudden cardiac deaths 
that occur annually in the United States, a better understand¬ 
ing of the genetic defects underlying arrhythmia is of consid¬ 
erable public health significance. 

Heart disease aggregates in families. This 
aggregation is especially strong if there is 
early age of onset and if there are several 
affected relatives. Specific genes have been 
identified for some subsets of families with 
heart disease, and lifestyle changes (exercise, 
diet, avoidance of tobacco) can modify heart 
disease risks appreciably. 

Stroke. Stroke, which refers to brain damage caused by a 
sudden and sustained loss of blood flow to the brain, can 
result from arterial obstruction (ischemic stroke, which 
accounts for 80% of stroke cases) or breakage (hemorrhagic 
stroke). This disease is the fourth leading cause of mortality 
in the United States, accounting for approximately 140,000 
deaths per year. As with heart disease, strokes cluster in fami¬ 
lies: One’s risk of having a stroke increases by two- to three¬ 
fold if a parent has had a stroke. The largest twin study 
conducted to date showed that concordance rates for stroke 
death in MZ and DZ twins were 10% and 5%, respectively. 
These figures imply that genes might influence one’s suscep¬ 
tibility to this disease. 

Stroke can be caused by any of more than a dozen single¬ 
gene disorders, including sickle cell disease (see Chapter 3), 
MELAS (mitochondrial myopathy, encephalopathy, lactic 
acidosis, and stroke, a mitochondrial disorder discussed in 
Chapter 5), and cerebral autosomal dominant arteriopathy 
with subcortical infarcts and leukoencephalopathy (CADASIL, 
a condition characterized by recurrent strokes and dementia 
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TABLE 12-6 Examples of Mendelian Subtypes of Complex Disorders 

MENDELIAN SUBTYPE 

PROTEiN (GENE) 

CONSEQUENCE OF MUTATION | 

Heart Disease 

Familial hypercholesterolemia 

LDL receptor (LDLR) 

Elevated LDL level 


Apolipoprotein B (APOB) 

Elevated LDL level 


Proprotein convertase subtilisin/kexin 

Gain-of-function mutations elevate LDL; 


type 9 (PCSK9) 

loss-of-function mutations reduce LDL 

Tangier disease 

ATP-binding cassette 1 (ABC1) 

Reduced HDL level 

Familial dilated cardiomyopathy 

Cardiac troponin T (TNNT2) 

Reduced force generation by sarcomere 


Titin (TTN) 

Destabilized sarcomere 


Cardiac (3-myosin heavy chain (MYH7) 

Reduced force generation by sarcomere 


(3-Sarcoglycan (SGCB) 

Destabilized sarcolemma and signal transduction 


5-Sarcoglycan (SGCDj 

Destabilized sarcolemma and signal transduction 


Dystrophin 

Destabilized sarcolemma in cardiac myocytes 

Familial hypertrophic 

Cardiac (3-myosin heavy chain (MYH7) 

Reduced force generation by sarcomere 

cardiomyopathy 

Cardiac troponin T (TNNT2) 

Reduced force generation by sarcomere 


Myosin-binding protein C (MYBPC) 

Sarcomere damage 

Long QT syndrome 

Cardiac potassium channel a subunit 

Prolonged QT interval on electrocardiogram, 


(LQT1, KCNQ1) 

arrhythmia 


Cardiac potassium channel a subunit 

Prolonged QT interval on electrocardiogram, 


(LQT2, KCNH2) 

arrhythmia 


Cardiac sodium channel (LQT3, SCN5A) 

Prolonged QT interval on electrocardiogram, 
arrhythmia 


Ankyrin B anchoring protein (LQT4, 

Prolonged QT interval on electrocardiogram. 


ANK2) 

arrhythmia 


Cardiac potassium channel (3 subunit 

Prolonged QT interval on electrocardiogram. 


(LQT5, KCNE1) 

arrhythmia 


Cardiac potassium channel subunit 

Prolonged QT interval on electrocardiogram. 


(LQT6, KCNE2) 

arrhythmia 

Hypertension 

Liddle syndrome 

Renal epithelial sodium channel 

Severe hypertension, low renin and suppressed 


subunits (SCNN1B, SCNN1G) 

aldosterone 

Gordon syndrome 

WNK1 or WNK4 kinase genes 

High serum potassium level and increased renal 
salt reabsorption 

Glucocorticoid-remediable 

Fusion of genes that encode 

Early-onset hypertension with suppressed 

aldosteronism 

aldosterone synthase and steroid 

plasma renin and normal or elevated 


11 (3-hydroxylase 

aldosterone levels 

Syndrome of apparent 

11 (3-Hydroxysteroid dehydrogenase 

Early-onset hypertension, low potassium and 

mineralocorticoid excess 

(np-HSD2) 

renin levels, low aldosterone 

Diabetes 

MODY1 

Flepatocyte nuclear factor-4a (HNE4A) 

Decreased insulin secretion 

MODY2 

Glucokinase (GCK) 

Impaired glucose metabolism, leading to mild 
nonprogressive hyperglycemia 

MODY3 

Hepatocyte nuclear factor-la (HNF1a) 

Decreased insulin secretion 

MODY4 

Insulin promoter factor-1 (IPF1) 

Decreased transcription of insulin gene 

MODY5 

Hepatocyte nuclear factor-1 (3 (HNFip) 

(3-cell dysfunction leads to decreased insulin 
secretion 

MODY6 

NeuroD transcription factor (NEURODl) 

Decreased insulin secretion 

Alzheimer Disease 

Familial Alzheimer disease 

Amyloid-(3 precursor protein (APP) 

Alteration of cleavage sites in amyloid-(3 
precursor protein, producing longer amyloid 
fragments 


Presenilin 1 (PSD 

Altered cleavage of amyloid-(3 precursor protein, 
producing larger proportion of long amyloid 
fragments 


Presenilin 2 (PS2) 

Altered cleavage of amyloid-(3 precursor protein, 
producing larger proportion of long amyloid 
fragments 


Continued 
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TABLE 12-6 Examples of Mendelian Subtypes of Complex Disorders^—cont'd 

MENDELIAN SUBTYPE 

PROTEIN (GENE) 

CONSEQUENCE OF MUTATION 

Parkinson Disease 

Familial Parkinson disease 
(autosomal dominant) 

Familial Parkinson disease 
(autosomal recessive) 

Familial Parkinson disease 
(autosomal dominant) 

a-Synuclein (PARK1, SNCA) 

Parkin; E3 ubiquityl ligase, thought to 
ubiquinate a-synuclein (PARK2) 

Ubiquitin C-hydrolase-LI (PARKS) 

Formation of a-synuclein aggregates 

Compromised degradation of a-synuclein 

Accumulation of a-synuclein 

Amyotrophic Lateral Sclerosis 

Familial amyotrophic lateral sclerosis 
Familial amyotrophic lateral sclerosis 
Juvenile amyotrophic lateral 
sclerosis (autosomal recessive) 

Superoxide dismutase 1 (SOD!) 
C90RF72 

Alsin (ALS2) 

Neurotoxic gain of function 

Expanded repeat mutation 

Presumed loss of function 

Epilepsy 

Benign neonatal epilepsy, types 1 
and 2 

Generalized epilepsy with febrile 
seizures plus type 1 

Autosomal dominant nocturnal 
frontal lobe epilepsy 

Generalized epilepsy with febrile 
seizures plus type 3 

Voltage-gated potassium channels 
(KCNQ2 and KCNQ3, respectively) 
Sodium channel (11 subunit (SCN1B) 

Neuronal nicotinic acetylcholine receptor 
subunits (CHRNA4 and CHRNB2) 
GABAa receptor (GABRG2) 

Reduced M current increases neuronal 
excitability 

Sodium current persistence leading to neuronal 
hyperexcitability 

Increased neuronal excitability in response to 
cholinergic stimulation 

Loss of synaptic inhibition leading to neuronal 
excitability 


HDL, High-density lipoprotein; LDL, low-density lipoprotein; MODY, maturity-onset diabetes of the young. 

*See Table 8-2 for genes involved in other diseases, including hearing loss and blindness. This table is not meant to provide an exhaustive 
list of genes; additional genes are discussed in the review papers cited at the end of Chapter 12. 


and caused by mutations in the NOTCH3 gene). Because 
blood clots are a common cause of stroke, it is expected that 
mutations in genes that encode coagulation factors might 
affect stroke susceptibility. For example, inherited deficiencies 
of protein C and protein S, both of which are coagulation 
inhibitors, are associated with an increased risk of stroke, 
especially in children. A specific mutation in clotting factor 
V, the factor V Leiden allele, causes resistance to activated 
protein C and thus produces an increased susceptibility 
to clotting. Heterozygosity for this allele, which is seen in 
approximately 5% of whites, produces a sevenfold increase in 
the risk of venous thrombosis (clots). In homozygotes, the 
risk increases to 100-fold. However, the evidence for an asso¬ 
ciation between the factor V Leiden allele and stroke is 
inconsistent. 

In addition to family history and specific genes, several 
factors are known to increase the risk of stroke. These include 
hypertension, obesity, atherosclerosis, diabetes, and smoking. 

Stroke, which clusters in families, is associated 
with several single-gene disorders and with 
some inherited coagulation disorders. 

Hypertension. Systemic hypertension is a key risk factor 
for heart disease, stroke, and kidney disease. It is estimated 
that hypertension is responsible for approximately half 
of all cardiovascular mortality. Studies of blood pressure 


correlations within families yield heritability estimates of 
approximately 30% to 50% for both systolic and diastolic 
blood pressure. Heritability estimates based on twin studies 
tend to be higher (about 60%) and may be inflated because 
of greater similarities in the environments of MZ compared 
with DZ twins. The fact that the heritability estimates are 
substantially less than 100% indicates that environmental 
factors must also be significant causes of blood pressure vari¬ 
ation. The most important environmental risk factors for 
hypertension are increased sodium intake, decreased physical 
activity, psychosocial stress, and obesity (as discussed later, 
the latter factor is itself influenced both by genes and 
environment). 

Blood pressure regulation is a highly complex process that 
is influenced by many physiological systems, including 
various aspects of kidney function, cellular ion transport, 
vascular tone, and heart function. Because of this complexity, 
much research is now focused on specific components that 
might influence blood pressure variation, such as the renin- 
angiotensin system (Fig. 12-8) (involved in sodium reabsorp¬ 
tion and vasoconstriction); vasodilators such as nitric oxide 
and the kallikrein-kinin system; and ion transport systems 
such as adducin and sodium-lithium countertransport. 
These individual factors are more likely to be under the 
control of smaller numbers of genes than is blood pressure 
itself, simplifying the task of identifying these genes and their 
role in regulating blood pressure. For example, linkage and 
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Stimulates release 
of aldosterone 
(sodium retention) 
and 

vasoconstriction 

i 

t Blood pressure 

Feedback to kidneys 
-I Renin secretion 


FIG 12-8 The renin-angiotensin-aldoste- 
rone system. T, Increased; -i-, decreased; 
ATi, angiotensin type II receptor 1. (Modi¬ 
fied from King RA, Rotter Jl, Motulsky 
AG, eds. The Genetic Basis of Common 
Diseases. New York: Oxford University 
Press; 1992.) 


association studies have implicated several genes involved in 
the renin-angiotensin system (e.g., the genes that encode 
angiotensinogen, angiotensin-converting enzyme type 1, and 
angiotensin 11 type 1 receptor) in causing hypertension. 

A small percentage of hypertension cases are the result of 
rare single-gene disorders, such as Liddle syndrome (low 
plasma aldosterone and hypertension caused by mutations 
that alter the ENaC epithelial sodium channel) and Gordon 
syndrome (hypertension, high serum potassium level, and 
increased renal salt reabsorption caused by mutations in the 
WNKl or WNK4 kinase genes); see Table 12-6 for additional 
examples. More than 20 genes have been identified that can 
lead to rare inherited forms of hypertension, and many of 
them affect the reabsorption of water and salt by the kidney, 
which in turn affects blood volume and blood pressure. It is 
hoped that isolation and study of these genes, and the path¬ 
ways in which they act, will lead to the identification of 
genetic factors underlying essential hypertension.* 

Large-scale genome scans, undertaken in more than 
100,000 humans and in experimental animals such as mice 
and rats, have sought to identify additional loci (see Box 12-1 ) 
that might underlie essential hypertension. These studies 
have identified dozens of genes that influence susceptibility 


*The term “essential” refers to the 95% of hypertension cases that 
are not caused by a known mutation or syndrome. 


to hypertension. Although the individual effects of each of 
these genes is small (as expected for a multifactorial trait), 
they collectively explain a significant proportion of the 
genetic risk for hypertension. 

Heritability estimates for systolic and diastolic 
blood pressure range from 30% to 50%. A 
number of genes responsible for rare hyper¬ 
tension syndromes have been identified, and 
genome scans have implicated regions that 
might contain genes that underlie susceptibil¬ 
ity to essential hypertension. Other risk factors 
for hypertension include increased sodium 
intake, lack of exercise, psychosocial stress, 
and obesity. 

Cancer 

Cancer is the second leading cause of death in the United 
States, although it is estimated that it might soon surpass 
heart disease as the leading cause of death. It is well estab¬ 
lished that many major types of cancer (e.g., breast, colon, 
prostate, ovarian) cluster strongly in families. This is due both 
to shared genes and shared environmental factors. Although 
numerous cancer genes have been isolated, environmental 
factors also play an important role in causing cancer by 
inducing somatic mutations. In particular, tobacco use is 
estimated to account for one third of all cancer cases in 
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developed countries, making it the most important known 
cause of cancer. Diet (i.e., carcinogenic substances and the 
lack of “anticancer” components such as fiber, fruits, and 
vegetables) is another leading cause of cancer and may also 
account for as much as one third of cancer cases. It is esti¬ 
mated that approjdmately 15% of worldwide cancer cases are 
caused primarily by infectious agents (e.g., human papilloma 
virus for cervical cancer, hepatitis B and C for liver cancer). 
Because cancer genetics was the subject of Chapter 11, we 
confine our attention here to genetic and environmental 
factors that influence susceptibility to some of the most 
common cancers. 

Breast cancer. Breast cancer is the second most com¬ 
monly diagnosed cancer (after skin cancer) among women, 
affecting approximately 12% of American women who live to 
age 85 years or older. Invasive breast cancer was diagnosed in 
approximately 230,000 American women in 2014. Although 
the breast cancer mortality rate has been declining for 25 
years, about 40,000 women in the United States die from this 
disease each year. Breast cancer was formerly the leading 
cause of cancer death among women, but it has been sur¬ 
passed by lung cancer. Breast cancer can also occur in men, 
with a lifetime prevalence that is roughly 100 times lower 
than that of women (2400 cases diagnosed in 2014). The 
familial aggregation of breast cancer has been recognized for 
centuries, having been described by physicians in ancient 
Rome. If a woman has one affected first-degree relative, her 
risk of developing breast cancer doubles. The risk increases 
further with additional affected relatives, and it increases if 
those relatives developed cancer at a relatively early age 
(before 50 years of age). 

Several genes are now known to predispose women to 
developing hereditary breast cancer. Most important among 
these are BRCAl and BRCA2, two genes involved in DNA 
repair (see Chapter 11). Germline mutations in the TP53 and 
CHK2 genes can cause Li-Fraumeni syndrome, which also 
predisposes to breast cancer. Cowden disease, a rare autoso¬ 
mal dominant condition that includes multiple hamartomas 
and breast cancer, is caused by mutations in the PTEN tumor 
suppressor gene (see Chapter 11). Ataxia-telangiectasia, an 
autosomal recessive disorder caused by defective DNA repair, 
includes breast cancer in its presentation. Mutations in the 
MSH2 and MLHl DNA repair genes, which lead to hereditary 
nonpolyposis colorectal cancer (HNPCC), also confer an 
increased risk of breast cancer. Despite the significance of 
these genes, it should be emphasized that more than 90% of 
breast cancer cases are not inherited as mendelian diseases. 

A number of environmental factors are known to increase 
the risk of developing breast cancer. These include nulliparity 
(never bearing children), bearing the first child after 30 years 
of age, a high-fat diet, alcohol use, and estrogen replacement 
therapy. 

Colorectal cancer. It is estimated that 1 in 20 Americans 
will develop colorectal cancer, and roughly one third of those 
with this cancer will die from it. With approximately 135,000 
new cases and 50,000 deaths in the United States each year, 
colorectal cancer is second only to lung cancer in the total 
number of annual cancer deaths. Like breast cancer, it clusters 


in families; familial clustering of this form of cancer was 
reported in the medical literature as early as 1881. The risk 
of colorectal cancer in people with one affected first-degree 
relative is two to three times higher than that of the general 
population. 

As discussed in Chapter 11 , familial colorectal cancer can 
be the result of mutations in the APC tumor suppressor gene 
or in one of several DNA mismatch-repair genes (HNPCC). 
Another, less common, inherited cause of colorectal cancer is 
the autosomal dominant Peutz-Jeghers syndrome. About half 
of Peutz-Jeghers cases are caused by mutations in the STKl 1 
tumor suppressor gene, which encodes a protein kinase. Juve¬ 
nile intestinal polyposis, an autosomal dominant disease 
defined by the presence of 10 or more polyps before adult¬ 
hood, can be caused by mutations in SMAD4 (see Chapter 
11), in BMPRAl (a receptor serine-threonine kinase gene), 
or, in rare cases, in PTEN. 

As with breast cancer, most colorectal cancer cases (>90%) 
are not inherited as mendelian conditions and are likely to be 
caused by a complex interaction of inherited and somatic 
genetic alterations and environmental factors. The latter risk 
factors include a lack of physical activity and a high-fat, low- 
fiber diet. 

Prostate cancer. Prostate cancer is the second most com¬ 
monly diagnosed cancer in men (after skin cancer), with 
approximately 220,000 new cases annually in the United 
States. Prostate cancer is second only to lung cancer as a cause 
of cancer death in men, causing more than 27,000 deaths each 
year. Having an affected first-degree relative increases the risk 
of developing prostate cancer by a factor of two to three, and 
the heritability of prostate cancer is estimated to be approxi¬ 
mately 40%. 

The relatively late age of onset of most prostate cancer 
cases (median age, 72 years) makes genetic analysis especially 
difficult. However, loss of heterozygosity (see Chapter 11) has 
been observed in a number of genomic regions in prostate 
tumor cells, possibly indicating the presence of genetic altera¬ 
tions in these regions. In addition, genome scans (GWAS) 
have identified several dozen polymorphisms associated with 
prostate cancer risk. Several of these are located in chromo¬ 
some 8q24, which contains polymorphisms associated with 
several other cancers as well (colon, pancreas, and esopha¬ 
gus). Although the 8q24 region contains no protein-coding 
genes, it contains enhancer elements that affect expression of 
the MYC oncogene, located about 250 kb from 8q24. 

Nongenetic risk factors for prostate cancer may include a 
high-fat diet. Because prostate cancer usually progresses 
slowly and because it can be detected by digital examination 
and by the prostate-specific antigen (PSA) test, fatal metasta¬ 
sis can usually be prevented. 

Most common cancers have genetic compo¬ 
nents. Recurrence risks tend to be higher if 
there are several affected relatives and if those 
relatives developed cancer at an early age. 
Specific genes have been discovered that 
cause inherited colon, breast, and prostate 
cancer in some families. 
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Diabetes Mellitus 

Like the other disorders discussed in this chapter, the etiology 
of diabetes mellitus is complex and not fully understood. 
Nevertheless, progress is being made in understanding the 
genetic basis of this disorder, which is the leading cause of 
adult blindness, kidney failure, and lower-limb amputation 
and a major cause of heart disease and stroke. An important 
advance has been the recognition that diabetes mellitus is 
actually a heterogeneous group of disorders, all characterized 
by elevated blood sugar. We focus here on the three major 
types of diabetes, type 1 (formerly termed insulin-dependent 
diabetes mellitus, or IDDM), type 2 (formerly termed non- 
insulin-dependent diabetes mellitus, or NIDDM), and 
maturity-onset diabetes of the young (MODY). 

Type 1 diabetes. Type 1 diabetes, which is characterized 
by T-cell infiltration of the pancreas and destruction of the 
insulin-producing beta cells, usually (though not always) 
manifests before 40 years of age. Patients with type 1 diabetes 
must receive exogenous insulin to survive. In addition to 
T-cell infiltration of the pancreas, autoantibodies are formed 
against pancreatic cells, insulin, and enzymes such as glu¬ 
tamic acid decarboxylase; these autoantibodies can be 
observed long before clinical symptoms occur. These find¬ 
ings, along with a strong association between type 1 diabetes 
and the presence of several human leukocyte antigen (HLA) 
class II alleles, indicate that this is an autoimmune disease. 
Over the past few decades, the incidence of type 1 diabetes 
has increased substantially. 

Siblings of persons with type 1 diabetes face a substantial 
elevation in risk: approximately 6%, as opposed to a risk of 
about 0.3% to 0.5% in the general population. The recur¬ 
rence risk is also elevated when there is a diabetic parent, 
although this risk varies with the sex of the affected parent. 
The risk for offspring of diabetic mothers is only 1% to 3%, 
but it is 4% to 6% for the offspring of diabetic fathers. 
(Because type 1 diabetes affects males and females in roughly 
equal proportions in the general population, this risk differ¬ 
ence is inconsistent with the sex-specific threshold model for 
multifactorial traits.) Twin studies show that the empirical 
risk for MZ twins of type 1 diabetes patients ranges from 30% 
to 50%. In contrast, the concordance rate for DZ twins is 5% 
to 10%. The fact that type 1 diabetes is not 100% concordant 
among identical twins indicates that genetic factors are not 
solely responsible for the disorder. There is evidence that 
specific viral infections contribute to the cause of type 1 
diabetes in at least some persons, possibly by activating an 
autoimmune response. 

The association of specific HLA class II alleles and type 1 
diabetes has been studied extensively, and it is estimated that 
the HLA loci account for about 40% to 50% of the genetic 
susceptibility to type 1 diabetes. Approximately 95% of whites 
with type 1 diabetes have the HLA DR3 and/or DR4 alleles, 
whereas only about 50% of the general white population has 
either of these alleles. If an affected proband and a sibling are 
both heterozygous for the DR3 and DR4 alleles, the sibling’s 
risk of developing type 1 diabetes is nearly 20% (i.e., about 
40 times higher than the risk in the general population). This 


association may in part reflect linkage disequilibrium between 
alleles of the DR locus and those of the HLA-DQ locus. The 
absence of aspartic acid at position 57 of the DQ polypeptide 
is strongly associated with susceptibility to type 1 diabetes; in 
fact, those who do not have this amino acid at position 57 
(and instead are homozygous for a different amino acid) are 
100 times more likely to develop type 1 diabetes. The aspartic 
acid substitution alters the shape of the HLA class II molecule 
and thus its ability to bind and present peptides to T cells (see 
Chapter 9). Altered T-cell recognition might help to protect 
persons with the aspartic acid substitution from an autoim¬ 
mune episode. 

The insulin gene, which is located on the short arm of 
chromosome 11, is another logical candidate for type 1 dia¬ 
betes susceptibility. Polymorphisms within and near this gene 
have been tested for association with type 1 diabetes. Intrigu- 
ingly, a strong risk association is seen with allelic variation in 
a VNTR polymorphism (see Chapter 3) located just 5' of the 
insulin gene. Differences in the number of VNTR repeat units 
might affect transcription of the insulin gene (possibly by 
altering chromatin structure), which would result in varia¬ 
tion in susceptibility. It is estimated that inherited genetic 
variation in the insulin region accounts for approximately 
10% of the familial clustering of type 1 diabetes. 

Affected sib-pair and genome-wide association analyses 
have been used extensively to map additional genes that can 
cause type 1 diabetes. In addition, an animal model, the non- 
obese diabetic (NOD) mouse, has been used to identify dia¬ 
betes susceptibility genes that could have similar roles in 
humans (see Box 12-1). These studies have identified dozens 
of additional genes associated with type 1 diabetes suscepti¬ 
bility. One of these is the CTLA4 (cytotoxic lymphocyte 
associated-4) gene, which encodes an inhibitory T-cell recep¬ 
tor. Several studies have demonstrated that alleles of CTLA4 
are associated with an increased risk of type 1 diabetes. There 
is growing evidence that variation in CTLAA is also associ¬ 
ated with other autoimmune diseases, such as rheumatoid 
arthritis and celiac disease. Another gene associated with 
type 1 diabetes susceptibility, PTPN22, is involved in T-cell 
regulation and is also associated with other autoimmune dis¬ 
orders, including rheumatoid arthritis and systemic lupus 
erythematosus. 

Type 2 diabetes. Type 2 diabetes accounts for more than 
90% of all diabetes cases, and its incidence is rising rapidly 
in populations with access to high-calorie diets. It currently 
affects approximately 10% to 20% of the adult populations 
of many developed countries. One study estimates that 
because of the rapid rate of increase of this disease, one third 
of Americans born in 2000 will eventually develop type 2 
diabetes. 

A number of features distinguish type 2 diabetes from type 
1 diabetes. Persons with type 2 diabetes usually have some 
degree of endogenous insulin production, at least in earlier 
stages of the disease, and they can sometimes be treated suc¬ 
cessfully with dietary modification, oral drugs, or both. In 
contrast to those with type 1 diabetes, patients with type 2 
diabetes have insulin resistance (i.e., their cells have difficulty 
using insulin) and are more likely to be obese. This form of 





258 


CHAPTER 12 Multifactorial Inheritance and Common Diseases 


TABLE 12-7 Comparison of the 

Major Features of Type 1 and Type 2 
Diabetes Mellitus 


TYPE 1 

TYPE 2 

FEATURE 

DIABETES 

DIABETES 

Age of onset 

Usually <40 yr 

Usually >40 yr 

Insulin production 

None 

Partial 

Insulin resistance 

No 

Yes 

Autoimmunity 

Yes 

No 

Obesity 

Not common 

Common 

MZ twin concordance 

0.35-0.50 

0.90 

Sibling recurrence risk 

1-6% 

15-40% 


MZ, Monozygotic. 


diabetes has traditionally been seen primarily in patients 
older than 40 years, but because of increasing obesity among 
adolescents and young adults, it is now increasing rapidly in 
this segment of the population. Neither HLA associations nor 
autoantibodies are seen commonly in this form of diabetes. 
MZ twin concordance rates are substantially higher than in 
type 1 diabetes, often exceeding 90% (because of age depen¬ 
dence, the concordance rate increases if older subjects are 
studied). The empirical recurrence risks for first-degree rela¬ 
tives of patients with type 2 diabetes are higher than those 
for type 1 patients, generally ranging from 15% to 40%. The 
differences between type 1 and type 2 diabetes are summa¬ 
rized in Table 12-7. 

The two most important risk factors for type 2 diabetes 
are a positive family history and obesity; the latter increases 
insulin resistance. The disease tends to rise in prevalence 
when populations adopt a diet and exercise pattern typical of 
United States and European populations. Increases have been 
seen, for example, among Japanese immigrants to the United 
States and among some native populations of the South 
Pacific, Australia, and the Americas. Several studies, con¬ 
ducted on both male and female subjects, have shown that 
regular exercise can substantially lower one’s risk of develop¬ 
ing type 2 diabetes, even among persons with a family history 
of the disease. This is partly because exercise reduces obesity. 
However, even in the absence of weight loss, exercise increases 
insulin sensitivity and improves glucose tolerance. 

Extensive linkage and genome-wide association analyses 
have identified more than 70 genes that contribute to type 2 
diabetes susceptibility. The most significant gene identified 
thus far is TCF7L2, which encodes a transcription factor 
involved in secreting insulin. A variant of TCF7L2 is associ¬ 
ated with a 50% increased risk of developing type 2 diabetes. 
A significant association has also been observed between type 
2 diabetes and a common allele of the gene that encodes 
peroxisome proliferator-activated receptor-y (PPAR-y), a 
nuclear receptor that is involved in adipocyte differentiation 
and glucose metabolism. This receptor is the target of thia- 
zolidinediones [TZDs], a class of drugs commonly used to 
increase insulin sensitivity in those with type 2 diabetes. 
Although the common PPARG allele confers only a 25% 


increase in the risk of developing type 2 diabetes, it is found 
in more than 75% of persons of European descent and thus 
helps to account for a significant fraction of type 2 diabetes 
cases. Variation in KCNJll, which encodes a potassium 
channel necessary for glucose-stimulated insulin secretion, 
confers an additional 20% increase in type 2 diabetes suscep¬ 
tibility. The associations between diabetes susceptibility and 
each of these genes have been widely replicated in numerous 
populations. Many rare susceptibility alleles in each of these 
diabetes-associated genes are now being discovered through 
high-throughput DNA sequencing studies. 

Maturity-onset diabetes of the young. MODY, which 
accounts for 1% to 5% of all diabetes cases, typically occurs 
before 25 years of age and follows an autosomal dominant 
mode of inheritance. In contrast to type 2 diabetes, it is not 
associated with obesity. Studies of MODY pedigrees have 
shown that about 50% of cases are caused by mutations in 
the gene that encodes glucokinase, a rate-limiting enzyme in 
the conversion of glucose to glucose-6-phosphate in the pan¬ 
creas. Another 40% of MODY cases are caused by mutations 
in any of five genes that encode transcription factors involved 
in pancreatic development or insulin regulation: hepatocyte 
nuclear factor-la (HNFla), hepatocyte nuclear factor-ljj 
(HNFlp), hepatocyte nuclear factor-4a (HNP4a), insulin 
promoter factor-1 (IPFl), and neurogenic differentiation 1 
(NEURODl). Mutations in these genes, all of which are 
expressed in pancreatic beta cells, lead to beta cell abnormali¬ 
ties and thus to diabetes. 

Type 1 (insulin-dependent) and type 2 (non- 
insulin-dependent) diabetes both cluster in 
families, with stronger familial clustering 
observed for type 2 diabetes. Type 1 has an 
earlier average age of onset, is HLA-associated, 
and is an autoimmune disease. Type 2 is not 
an autoimmune disorder and is more likely to 
be seen in obese persons. Several genes have 
been identified that increase susceptibility to 
type 1 or type 2 diabetes. Most cases of auto¬ 
somal dominant MODY are caused by muta¬ 
tions in any of six specific genes. 

Obesity 

The worldwide prevalence of obesity is increasing rapidly 
among adults and children. Approximately 70% of American 
adults and 60% of British adults are overweight (body mass 
index [BMI] >25),^ and about half of these overweight 
persons are obese (BMI >30). Although obesity itself is not a 
disease, it is an important risk factor for several common 
diseases, including heart disease, stroke, type 2 diabetes, and 
cancers of the prostate, breast, and colon. 

As one might expect, there is a strong correlation between 
obesity in parents and obesity in their children. This could 


^BMI is defined as W/H^, where W is weight in kilograms and H is 
height in meters. 
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easily be ascribed to common environmental effects: parents 
and children usually share similar diet and exercise habits. 
However, there is good evidence for genetic components as 
weU. Four adoption studies each showed that the body weights 
of adopted persons correlated significantly with their natural 
parents’ body weights but not with those of their adoptive 
parents. Twin studies also provide evidence for a genetic 
effect on body weight, with most studies yielding heritability 
estimates between 0.60 and 0.80. The heritability of “fatness” 
(measured, for example, by skinfold thickness) is approxi¬ 
mately 0.50. 

Research, aided substantially by mouse models, has shown 
that several genes likely play a role in human obesity. Impor¬ 
tant among these are the genes that encode leptin (Greek, 
“thin”) and its receptor. The leptin hormone is secreted by 
adipocytes (fat storage cells) and binds to receptors in the 
hypothalamus, the site of the body’s appetite control center. 
Increased fat stores lead to an elevated leptin level, which 
produces satiety and a loss of appetite. Lower leptin levels 
lead to increased appetite. Mice with loss-of-function muta¬ 
tions in the leptin gene have uncontrolled appetites and 
become obese. When injected with leptin, these mice lose 
weight. Mice with mutations in the leptin receptor gene 
cannot respond to increased leptin levels and also develop 
obesity. 

Identification of the leptin gene and its receptor in mice 
led to their identification in humans, which in turn prompted 
optimistic predictions that leptin could be a key to weight loss 
in humans (without the perceived unpleasantness of dieting 
and exercise). However, most obese humans have high levels 
of leptin, indicating that the leptin gene is functioning nor¬ 
mally. Leptin receptor defects were then suspected, but these 
are also uncommon in humans. Although mutations in the 
human leptin gene and its receptor have now been identified 
in a few humans with severe obesity (BMI >40), they both 
appear to be extremely rare. Unfortunately, studying these 
genes wiU not solve the problem of human obesity. However, 
clinical trials using recombinant leptin have demonstrated 
moderate weight loss in a subset of obese individuals. 

In addition, leptin participates in important interactions 
with other components of appetite control, such as neuro¬ 
peptide Y, as well as a-melanocyte-stimulating hormone 
and its receptor, the melanocortin-4 receptor (MC4R). 
Mutations in the gene that encodes MC4R have been found 
in 3% to 5% of severely obese individuals. Several genome¬ 
wide association studies have demonstrated an association 
between an intronic variant in the brain-expressed PTO gene 
and obesity in whites. Homozygosity for this variant, which 
is seen in about 16% of whites, confers increased risks of 
overweight and obesity of 40% and 70%, respectively. Recent 
evidence shows that the FTO variant is part of an enhancer 
that binds to the IRX gene, which is located 2 Mb away from 
PTO and is involved in regulation of fat mass. Identification 
of these and other obesity-predisposing genes is leading to a 
better understanding of appetite control in humans and 
could eventually lead to effective treatments for some cases 
of obesity. 


Adoption and twin studies indicate that at 
least half of the population variation in obesity 
may be caused by genes. Specific genes and 
gene products involved in appetite control and 
susceptibility to obesity, including leptin and 
its receptor, MC4R, and FTO, are now being 
studied. 

Alzheimer Disease 

Alzheimer disease (AD), which is responsible for 60% to 70% 
of cases of progressive cognitive impairment among the 
elderly, affects approximately 10% of the population older 
than 65 years and 40% of the population older than 85 years. 
Because of the aging of the population, the number of Ameri¬ 
cans with AD, which is currently about 5 million, continues 
to increase. Alzheimer disease is characterized by progressive 
dementia and memory loss and by the formation of amyloid 
plaques and neurofibrillary tangles in the brain, particularly 
in the cerebral cortex and hippocampus. The plaques and 
tangles lead to progressive neuronal loss, and death usually 
occurs within 7 to 10 years after the first appearance of 
symptoms. 

The risk of developing AD doubles in persons who have 
an affected first-degree relative. Although most cases do not 
appear to be caused by single genes, approximately 10% 
follow an autosomal dominant mode of transmission. About 
3% to 5% of AD cases occur before age 65 years and are 
considered early onset; these are much more likely to be 
inherited in autosomal dominant fashion. 

Alzheimer disease is a genetically heterogeneous disorder. 
Approximately half of early-onset cases can be attributed to 
mutations in any of three genes, all of which affect amyloid-(3 
deposition. Two of the genes, presenilin 1 (PSl) and preseni- 
lin 2 (PS2), are very similar to each other, and their protein 
products are involved in cleavage of the amyloid-(3 precursor 
protein (ARP) by y-secretase (posttranslational modification; 
see Chapter 2). Gain-of-function mutations in PSl or PS2 
affect the cleavage of APP such that amyloid-producing forms 
of it accumulate excessively and are deposited in the brain 
(Fig. 12-9). This is thought to be a primary cause of AD. 
Mutations in PSl typically result in especially early onset of 
AD, with the first occurrence of symptoms in the fifth decade 
of life. 

A small percentage of cases of early-onset AD are caused 
by mutations of the gene (APP) that encodes APP itself, 
which is located on chromosome 21. These mutations disrupt 
normal secretase cleavage sites in APP (see Fig. 12-9), again 
leading to the accumulation of the longer protein product. It 
is interesting that this gene is present in three copies in 
persons with trisomy 21, where the extra gene copy leads to 
amyloid deposition and the frequent occurrence of early- 
onset AD in Down syndrome patients (see Chapter 6). High- 
throughput DNA sequencing studies have revealed an allele 
in APP that is protective against Alzheimer disease and may 
help to prevent cognitive decline. 

An important risk factor for the more common late-onset 
form of AD is allelic variation in the apolipoprotein E (APOP) 
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FIG 12-9 A, Cleavage of the amyloid-p precursor protein (APP) by a-secretase disrupts the 
amyloid-P protein and prevents the formation of amyloid plaques. B, An alternate cleavage 
pathway involves cleavage of APP by p-secretase at the N terminus and ysecretase at the C 
terminus, producing a protein product of 40 to 42 amino acids. Gain-of-function mutations in the 
presenilin genes result in increased cleavage activity via this pathway. This yields an excessive 
amount of the 42-amino acid form of APP, which leads to formation of amyloid plaques. Muta¬ 
tions in the APP gene can also alter the a-secretase cleavage sites such that excess amounts 
of the longer form of APP are produced. 


locus, which has three major alleles: el, £3, and £4. Studies 
conducted in diverse populations have shown that persons 
who have one copy of the £4 allele are 2 to 5 times more likely 
to develop AD, and those with two copies of this allele are 5 
to 10 times more likely to develop AD. The risk varies some¬ 
what by population, with higher £4-associated risks in whites 
and Japanese and relatively lower risks in Latin Americans 
and African Americans. Despite the strong association 
between £4 and AD, approximately half of persons who 
develop late-onset AD do not have a copy of the £4 allele, and 
many who are homozygous for £4 remain free of AD even at 
advanced age. The apolipoprotein E protein product is not 
involved in cleavage of APP but instead appears to be associ¬ 
ated with clearance of amyloid from the brain. 

Genome-wide association studies, based on microarrays 
and high-throughput sequencing, indicate that there are 
many additional genes associated with late-onset AD. Disease- 
associated variants that are relatively common in populations 
tend to confer only a modest increase in risk, as is the case 
with most multifactorial conditions. Less common disease- 
associated variants, revealed by high-throughput sequencing, 
confer a higher relative risk of AD but are generally rare in 
populations. 

AD has several features that have made it somewhat refrac¬ 
tory to genetic analysis. Its genetic heterogeneity has already 
been described. In addition, because a definitive diagnosis 
can be obtained only by a brain autopsy, it is often difficult 
to diagnose AD in living family members (although clinical 
features and brain imaging studies can provide strong evi¬ 
dence that a person is affected with AD). Finally, because 
onset of the disease can occur very late in life, persons carry¬ 
ing an AD-predisposing mutation could die from another 
cause before developing the disease. They would then be mis- 
identified as noncarriers. These types of difficulties arise not 


only with AD but with many other common adult diseases as 
well. Despite these obstacles, multiple AD-associated genes 
have now been identified, leading to a better understanding 
of the disorder and to the possibility of more effective AD 
treatment. 

Approximately 10% of AD cases are caused by 
autosomal dominant genes. Early-onset cases 
cluster more strongly in families and are more 
likely to follow an autosomal dominant inheri¬ 
tance pattern. This disease is genetically het¬ 
erogeneous, and many AD susceptibility genes 
have been identified. Three genes (encoding 
presenilin 1, presenilin 2, and amyloid-p pre¬ 
cursor protein) cause early-onset AD and affect 
the cleavage and processing of the amyloid 
precursor protein. The most significant gene 
associated with late-age onset AD encodes the 
apolipoprotein E protein. 

Alcoholism 

At some point in their lives, alcoholism (alcohol dependence) 
is diagnosed in approximately 10% of men and 3% to 5% of 
women in the United States. More than 100 studies have 
shown that this disease clusters in families: the risk of devel¬ 
oping alcoholism among persons with one affected parent is 
three to five times higher than for those with unaffected 
parents. Most twin studies have yielded concordance rates for 
DZ twins of less than 30% and for MZ twins in excess of 
60%, with an estimated heritability of approximately 50%. 
Adoption studies have shown that the offspring of an alco¬ 
holic parent, even when raised by nonalcoholic parents, have 
a fourfold increased risk of developing the disorder. To 
control for possible prenatal effects from an alcoholic mother. 
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some studies have included the offspring of alcoholic fathers 
only. The results have remained the same. These data argue 
that there may be genes that predispose some people to 
alcoholism. 

Alcohol dependence can be divided into different sub- 
types, and some researchers distinguish two major types of 
alcoholism. Type 1 is characterized by a later age of onset 
(after 25 years of age), occurrence in both males and females, 
and greater psychological dependency on alcohol. Type I 
alcoholics are more likely to be introverted, solitary drinkers. 
This form of alcoholism is less likely to cluster in families 
(one study yielded a heritability estimate of 0.21), has a less- 
severe course, and is more easily treated. Type 11 alcoholism 
is seen predominantly in males, typically occurs before 25 
years of age, and tends to involve persons who are extroverted 
and thrill-seeking. This form is more difficult to treat success¬ 
fully and tends to cluster more strongly in families, with 
heritability estimates ranging from 0.55 to more than 0.80. 

It has long been known that an individual’s physiological 
response to alcohol can be influenced by variation in the key 
enzymes responsible for alcohol metabolism: alcohol dehy¬ 
drogenase (ADH), which converts ethanol to acetaldehyde, 
and aldehyde dehydrogenase (ALDH), which converts acetal¬ 
dehyde to acetate. In particular, an allele of the ALDH2 gene 
(ALDH2*2) results in excessive accumulation of acetaldehyde 
and thus in facial flushing, nausea, palpitations, and light¬ 
headedness. Because of these unpleasant effects, persons who 
have the ALDH2*2 allele are much less likely to become alco¬ 
holics. This protective allele is common in some East Asian 
populations, with a frequency of approximately 40%, but is 
rare in most other populations. 

A number of genome scans have been undertaken in large 
cohorts of alcoholics and controls. One of the most consistent 
findings is that variants in genes that encode components of 
gamma-aminobutyric acid (GABA) receptors are associated 
with addiction to alcohol. This finding is biologically plau¬ 
sible, because the GABA neurotransmitter system inhibits 
excitatory signals in neurons, exerting a calming effect. 
Alcohol has been shown to increase GABA release, and allelic 
variation in GABA receptor genes might modulate this effect. 

It should be underscored that we refer to genes that might 
increase one’s susceptibility to alcoholism. This is obviously a 
disease that requires an environmental component, regard¬ 
less of one’s genetic constitution. 

Twin and adoption studies show that alcohol¬ 
ism clusters quite strongly in families, reflect¬ 
ing a genetic contribution to this disease. 
Familial clustering is particularly strong for 
type II alcoholism (early-onset form primarily 
affecting males). 

Psychiatric Disorders 

The major psychiatric diseases, including schizophrenia, 
bipolar disorder, and autism spectrum disorder, have been 
the subjects of numerous genetic studies. Twin, adoption, and 
family studies have shown that these disorders aggregate in 


families. Moreover, schizophrenia, biopolar disorder, and 
autism tend to occur together in the same families. In part, 
this reflects the fact that some of the same genes influence 
susceptibility to each of these conditions. 

Schizophrenia. Schizophrenia is a severe emotional dis¬ 
order characterized by delusions, hallucinations, retreat from 
reality, and bizarre, withdrawn, or inappropriate behavior 
(contrary to popular belief, schizophrenia is not a “split per¬ 
sonality” disorder). The lifetime recurrence risk for schizo¬ 
phrenia among the offspring of one affected parent is 
approximately 8% to 10%, which is about 10 times higher 
than the risk in the general population. The empirical risks 
increase when more relatives are affected. For example, a 
person with an affected sibling and an affected parent has a 
risk of about 15% to 20%, and a person with two affected 
parents has a risk of 40% to 50%. The risks decrease when 
the affected family member is a second- or third-degree rela¬ 
tive. Details are given in Table 12-8. On inspection of this 
table, it may seem puzzling that the proportion of schizo¬ 
phrenic probands who have a schizophrenic parent is only 
about 5%, which is substantially lower than the risk for other 
first-degree relatives (e.g., siblings, affected parents, and their 
offspring). This can be explained by the fact that schizo¬ 
phrenics are less likely to marry and produce children than 
are other persons. There is thus substantial selection against 
schizophrenia in the population. 

Twin and adoption studies indicate that genetic factors are 
likely to be involved in schizophrenia. Data pooled from mul¬ 
tiple twin studies show a 47% concordance rate for MZ twins, 
compared with only 12% for DZ twins. The concordance rate 
for MZ twins reared apart, 46%, is about the same as the rate 
for MZ twins reared together. The risk of developing the 
disease for offspring of a schizophrenic parent who are 
adopted by normal parents is about 10%, approximately the 
same as the risk when the offspring are raised by a schizo¬ 
phrenic biological parent. 

Dozens of genome scans have been performed in an 
effort to locate schizophrenia genes. The techniques discussed 
in Ghapter 8 (GWAS, exome sequencing) have identified 


TABLE 12-8 Recurrence Risks for 
Relatives of Schizophrenic Probands, 
Based on Multiple Studies of Western 
European Populations 



Adapted from McGue M, Gottesman II, Rao DC. The analysis of 
schizophrenia family data. Behav Genet. 1986;16:75-87. 
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significant associations between schizophrenia and polymor¬ 
phisms in more than 100 genomic regions. A number of the 
genes in these regions encode components of the glutamater- 
gic and dopaminergic neuronal signaling pathways. These 
findings are biologically plausible because the major drugs 
used to treat schizophrenia block dopamine receptors. 

Bipolar disorder. Bipolar disorder, also known as manic- 
depressive disorder, is characterized by extreme mood swings 
and emotional instability. The prevalence of the disorder in 
the general population is approximately 0.5% to 1%, but it 
rises to 5% to 10% among those with an affected first-degree 
relative. A study based on the Danish twin registry yielded 
concordance rates of 79% and 24% for MZ and DZ twins, 
respectively, yielding a heritabUity estimate of approximately 
72% (other studies have yielded even higher heritability 
estimates—up to 90%). The corresponding concordance 
rates for major depressive disorder (sometimes termed uni¬ 
polar depression) were 54% and 19%. Thus, it appears that 
bipolar disorder is more strongly influenced by genetic factors 
than is major depressive disorder. 

As with schizophrenia, many linkage and genome-wide 
association studies have been carried out to identify genes 
contributing to bipolar disorder. These studies have impli¬ 
cated many chromosome regions in multiple population 
samples. Some of the best-replicated results are for genes 
that encode voltage-gated calcium channels. Ion-channel 
modulating drugs are frequently used as mood stabilizers, 
giving these results functional plausibility. Many of the 
genes associated with bipolar disorder are also associated 
with schizophrenia. 

Autism spectrum disorder. Autism spectrum disorder 
(ASD) is characterized by impairments in social interaction 
and communication, as well as restricted, repetitive, and ste¬ 
reotyped behaviors and activities. The definition of ASD has 
been broadened considerably in the past decade or so (e.g., it 
now includes Asperger syndrome), resulting in a much higher 
current prevalence estimate (approximately 1% worldwide). 
ASD is 3-4 times more common in males than in females and 
is usually diagnosed during the first three years of life. As 
predicted by the multifactorial threshold model, the empiri¬ 
cal recurrence risk is substantially higher in the siblings of 
females with ASD. Genetic factors play a large role in the 
etiology of ASD, with twin studies suggesting a heritability of 
approximately 70%. Many large, well-controlled studies have 
investigated the relationship between ASD and vaccination 
(in particular for measles, mumps, and rubella), and no rela¬ 
tionship has been found. 

Approximately 10% of ASD cases have a known mende- 
lian condition or genetic syndrome, most of which have been 
discussed in previous chapters. These include Fragile X syn¬ 
drome (1-2% of cases), Rett syndrome (0.5%), tuberous scle¬ 
rosis (1%), and neurofibromatosis type 1 (<1%). The most 
frequent cytogenetic abnormality seen in persons with ASD 
(1-3%) is a maternally transmitted duplication of chromo¬ 
some 15qll-ql3, the region that is deleted in Prader-Willi 
and Angelman syndromes (see Chapter 5). A 600 kb duplica¬ 
tion or deletion of 16pl 1.2 is seen in about 1% of ASD cases. 


Numerous GWAS and sequencing studies have been 
carried out to identify additional loci associated with ASD, 
and more than 100 susceptibility loci (each with small effect) 
have been identified. An increased number of de novo muta¬ 
tions are seen in children with ASD, most of which are inher¬ 
ited paternally (these are detected by comparing the DNA 
sequences of affected cases with those of their unaffected 
parents). This may help to explain the increased risk of autism 
among the offspring of older fathers. In addition, an increase 
in the number of copy number variants (CNVs) is seen in 
persons with autism. 

Psychiatric disorders, such as schizophrenia, bipolar dis¬ 
order, and ASD, present a number of challenges for genetic 
analysis. These diseases are undoubtedly heterogeneous, 
reflecting the influence of numerous genetic and environ¬ 
mental factors (the same is true of other psychiatric diseases, 
such as attention hyperactivity deficit disorder, which are not 
discussed here). This genetic heterogeneity is to be expected, 
in part because at least one-third of human genes are expressed 
in the brain. Also, the definition of a psychiatric phenotype 
is not always straightforward, and it can change through time. 
Several measures are being taken to improve the likelihood 
of hnding genes underlying these conditions. Phenotypes are 
being defined in standardized and rigorous fashion. Larger 
sample sizes of affected persons (often tens of thousands), 
with more rigorous phenotype dehnition, are being collected 
in efforts to increase the power to detect linkage and associa¬ 
tion. Heterogeneity can be decreased by studying clinically 
defined subtypes of these diseases and by carrying out studies 
in genetically homogeneous populations. 

Marked familial aggregation, as well as familial 
co-occurrence, has been observed for schizo¬ 
phrenia, bipolar disorder, and autism spectrum 
disorder. Case-control and family studies have 
revealed evidence that these conditions are 
caused by brain-expressed genes that encode 
neurotransmitters, receptors, ion channels, 
and neurotransmitter-related enzymes. 

Other Complex Disorders 

The disorders discussed in this chapter represent some of the 
most common multifactorial disorders and those for which 
significant progress has been made in identifying genes. Many 
other multifactorial disorders are being studied as well, and 
in some cases specific susceptibility genes have been identi¬ 
fied. These include, for example, Parkinson disease, hearing 
loss, multiple sclerosis, amyotrophic lateral sclerosis, epilepsy, 
asthma, inflammatory bowel disease, and some forms of 
blindness (see Table 12-6 and Table 8-2 in Chapter 8). 

SOME GENERAL PRINCIPLES 
AND CONCLUSIONS 

Some general principles can be deduced from the results 
obtained thus far regarding the genetics of complex disorders. 
First, the more strongly inherited forms of complex disorders 
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generally have an earlier age of onset (examples include breast 
cancer, Alzheimer disease, and heart disease). Often, these 
include subsets of cases in which there is single-gene inheri¬ 
tance. Second, when there is laterality, bilateral forms some¬ 
times cluster more strongly in families (e.g., cleft lip/palate or 
breast cancer). Third, while the sex-specific threshold model 
fits some of the complex disorders (e.g., pyloric stenosis, cleft 
lip/palate, autism, heart disease), it fails to fit others (e.g., type 
1 diabetes). 

There is a tendency, particularly among the lay public, to 
assume that the presence of a genetic component means that 
the course of a disease cannot be altered (“If it’s genetic, you 
can’t change it”). This is incorrect. Most of the diseases dis¬ 
cussed in this chapter have both genetic and environmental 
components. Thus, environmental modification (e.g., diet, 
exercise, stress reduction) can often reduce risk significantly. 
Such modifications may be especially important for persons 
who have a family history of a disease, because they are likely 
to develop the disease earlier in life. Those with a family 
history of heart disease, for example, can often add many years 


of productive living with relatively minor lifestyle alterations. 
By targeting those who can benefit most from intervention, 
genetics helps to serve the goal of preventive medicine. 

In addition, the identification of a specific genetic altera¬ 
tion can lead to more effective prevention and treatment 
of the disease. Identification of mutations causing familial 
colorectal cancer can allow early screening and prevention of 
metastasis. Pinpointing a gene responsible for a neurotrans¬ 
mitter defect in a psychiatric disorder such as schizophrenia 
could lead to the development of more effective drug treat¬ 
ments. In some cases, such as familial hypercholesterolemia, 
gene therapy may be useful, ft is important for health-care 
practitioners to make their patients aware of these facts. 

Although the genetics of common disorders is complex 
and sometimes confusing, the public health impact of these 
diseases and the evidence for hereditary factors in their etiol¬ 
ogy demand that genetic studies be pursued. Substantial 
progress is already being made. The next decade will undoubt¬ 
edly witness many advancements in our understanding and 
treatment of these disorders. 


I STUDY QUESTIONS 


1. Consider a multifactorial trait that is twice as common in 
females as in males. Indicate which type of mating is at 
higher risk for producing affected children (affected father 
and normal mother versus normal father and affected 
mother). Is the recurrence risk higher for their sons or 
their daughters? 

2. Consider a disease that is known to have a 5% sibling 
recurrence risk. This recurrence risk could be the result of 
either multifactorial inheritance or a single autosomal 
dominant gene with 10% penetrance. How would you test 
which of these possibilities is correct? 


3. One member of a pair of monozygotic twins is affected by 
an autosomal dominant disease, and the other is not. List 
two different ways this could happen. 

4. Suppose that the heritability of body fat percentage is 0.80 
when correlations between siblings are studied but only 
0.50 when correlations between parents and offspring are 
studied. Suppose also that a significant positive correlation 
is observed in the body fat percentages of spouses. How 
would you interpret these results? 
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Genetic Testing and Gene Therapy 


As we have seen in previous chapters, significant advances 
have occurred in many areas of medical genetics, including 
DNA sequencing technology, disease-gene identification, and 
cytogenetics. These developments have paved the way for 
more accurate and efficient testing of genetic disorders. 
Genetic testing can be defined as the analysis of chromo¬ 
somes, DNA, RNA, proteins, or other analytes* to detect 
abnormalities that can cause a genetic disease. Indications for 
genetic testing include prenatal diagnosis, heterozygote 
carrier detection, and presymptomatic diagnosis of genetic 
disease. The principles and applications of genetic testing in 
these contexts are one focus of this chapter. 

The other focus is the treatment of genetic disease. Many 
aspects of disease management involve areas of medicine, 
such as surgery and drug treatment, that are beyond the scope 
of this book. However, gene therapy, in which patients’ cells 
are genetically altered to combat specific diseases, is being 
used increasingly to treat disease and is discussed in the 
second part of this chapter. 

POPULATION SCREENING 
FOR GENETIC DISEASE 

Screening tests represent an important component of routine 
health care. These tests are usually designed to detect treatable 
human diseases in their presymptomatic stage. Papanicolaou 
(Pap) tests for detecting cervical dysplasia and population 
screening for hypercholesterolemia are well-known examples 
of this public health strategy. Population screening is large- 
scale testing of populations for disease in an effort to identify 
persons who probably have the disease and those who prob¬ 
ably do not. Screening tests are not intended to provide defin¬ 
itive diagnoses; rather, they are aimed at identifying a subset 
of the population on whom further, more exact, diagnostic 
tests should be carried out. This important distinction is 
commonly misunderstood by the lay public and seldom clari¬ 
fied by the popular media. 

Genetic screening is population screening for genetic 
variants that can cause the disease in the person carrying the 
variant or in the descendants of the carrier. Newborn screen¬ 
ing for inherited metabolic diseases (see Chapter 7) is a 
good example of the first type of genetic screening, and het¬ 
erozygote detection for Tay-Sachs disease (discussed later) 


*An analyte is any substance that is subject to analysis. 


exemplifies the second. These two examples involve screening 
of populations, but genetic screening can also be applied to 
members of families with a positive history of a genetic con¬ 
dition. An example is testing for a balanced reciprocal trans¬ 
location in families in which one or more members have had 
a chromosome disorder (see Chapter 6). Box 13-1 lists the 
various types of genetic screening, including several forms of 
prenatal diagnosis, that are discussed in this chapter. 

The goal of screening is early recognition of a 
disorder so that intervention will prevent or 
reverse the disease process (as in newborn 
screening for inborn errors of metabolism) or 
so that informed reproductive decisions can be 
made (as in screening for heterozygous carriers 
of an autosomal recessive mutation). A positive 
result from a genetic screening test is typically 
followed by a more precise diagnostic test. 

Principles of Screening 

The basic principles of screening were developed in the 1960s 
and are still widely recognized. Characteristics of the disease, 
the test, and the health-care system should be considered 
when deciding whether population screening is appropriate. 

First, the condition should be serious and relatively 
common. This ensures that the benefits to be derived from 
the screening program will justify its costs. The natural 
history of the disease should be clearly understood. There 
should be an acceptable and effective treatment, or, in the case 
of some genetic conditions, prenatal diagnosis should be 
available. As for the screening test itself, it should be accept¬ 
able to the population, easy to perform, and relatively inex¬ 
pensive. The screening test should be valid and reliable. 
Finally, the resources for diagnosis and treatment of the dis¬ 
order must be accessible. A strategy for communicating 
results efficiently and effectively must be in place. 

Screening programs typically use tests that are widely 
applicable and inexpensive to identify an at-risk population 
(e.g., the phenylketonuria (PKU) screening program, dis¬ 
cussed in Clinical Commentary 13-1). Members of this popu¬ 
lation are then targeted for subsequent tests that are more 
accurate but also more expensive and time consuming. In this 
context, the screening test should be able to effectively sepa¬ 
rate persons who have the disease from those who do not. 
This attribute, which defines the test’s validity, involves two 
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BOX 13-1 Types of Genetic Screening and Prenatal Diagnosis 


Population Screening for Genetic Disorders 
Newborn Screening 

Blood 

• Phenylketonuria, all 50 states in the United States 

• Galactosemia, all 50 states in the United States 

• Hypothyroidism, all 50 states in the United States 

• Hemoglobinopathies, nearly all states 

• Other: cystic fibrosis, congenital adrenal hyperplasia, maple 
syrup urine disease, homocystinuria, tyrosinemia, and 
several other diseases or inborn errors of metabolism are 
screened in the US and Europe 

Universal newborn hearing screening (>60% of congenital 
hearing loss is due to genetic factors) 

Heterozygote Screening 

Tay-Sachs disease, Ashkenazi Jewish population 
Sickle cell disease, African-American population 
Thalassemias in at-risk ethnic groups 

Cystic fibrosis in some populations (persons of European 
descent, Ashkenazi Jews) 

Prenatal Diagnosis of Genetic Disorders 
Diagnostic Testing (invasive Prenatal Diagnosis) 

Amniocentesis 

Chorionic villus sampling 

Percutaneous umbilical blood sampling (PUBS) 

Preimplantation genetic diagnosis 


Fetal Visualization Techniques 

Ultrasonography 

Radiography 

Magnetic resonance imaging 

Population Screening 

Maternal age >35 years 

Family history of condition diagnosable by prenatal techniques 

Quadruple screen: maternal serum a-fetoprotein, estriol, 
human chorionic gonadotropin, inhibin-A 

First trimester screening: ultrasonography, PAPP-A, and free (3 
subunit of human chorionic gonadotropin 

Noninvasive prenatal screening using cell-free DMA from 
maternal plasma 

Family Screening for Genetic Disorders 

Family history of chromosomal rearrangement (e.g., 
translocation) 

Screening female relatives in an X-linked pedigree (e.g., Duch- 
enne muscular dystrophy, fragile X syndrome) 

Heterozygote screening within at-risk families (e.g., cystic 
fibrosis) 

Presymptomatic screening (e.g., Huntington disease, breast 
cancer, colon cancer) 


CLINICAL COMMENTARY 13-1 
Neonatal Screening for Phenylketonuria 

Disease Characteristics 

Population screening of newborns for PKU represents an excel¬ 
lent example of the application of the screening model to 
genetic disease. As discussed in Chapter 4, the prevalence of 
this autosomal recessive disorder of phenylalanine metabolism 
is about 1 in 10,000 to 15,000 white births in the United States. 
The natural history of PKU is well understood. More than 95% 
of untreated PKU patients develop moderate to severe intel¬ 
lectual disability. The condition is difficult to diagnose clinically 
in the first year of life because the physical signs are subtle 
and PKU usually manifests only as developmental delay. 
Dietary restriction of phenylalanine, when begun before 4 
weeks of age, is highly effective in altering the course of the 
disease. A low-phenylalanine diet, although not particularly pal¬ 
atable, largely eliminates the IQ loss that would otherwise 
occur (an important exception is those who have a defect in 
biopterin metabolism, in whom a different therapy is used). 

Test Characteristics 

Until recently PKU was typically detected by the measurement 
of blood phenylalanine using a bacterial inhibition assay, the 
Guthrie test. Blood is collected in the newborn period, usually 
by heel stick, and placed on filter paper. The dried blood is 
placed on an agar plate and incubated with a strain of bacteria 
(Bacillus subtllis) that requires phenylalanine for growth. Mea¬ 
surement of bacterial growth permits quantification of the 
amount of phenylalanine in the blood sample. Tandem mass 
spectrometry (see below) is now usually used to screen for 


PKU and a number of other inborn errors of metabolism. Posi¬ 
tive test results are repeated and followed by a quantitative 
assay of plasma phenylalanine and tyrosine. 

If the test is performed after 2 days of age and after regular 
feeding on a protein diet, the detection rate (sensitivity) is about 
98%. If it is performed at less than 24 hours of age, the sensitiv¬ 
ity is about 84% and for this reason a repeat test is recom¬ 
mended a few weeks after birth. Specificity is close to 100%. 

System Characteristics 

Because of the requirement of normal protein in the diet, many 
states request rescreening at 2 to 4 weeks of age. At that point, 
sensitivity approaches 100%. A high sensitivity level is impor¬ 
tant because of the severe impact of a missed diagnosis. 

Phenylalanine levels in children with classic PKU typically 
exceed 20 mg/dL. For every 20 positive PKU screening results, 
only one infant has classic PKU. The others represent either 
false-positive findings (usually due to a transient reversible 
tyrosinemia) or infants with a form of hyperphenylalaninemia 
(elevated phenylalanine) not caused by classic PKU. 

The cost of PKU screening is typically less than a few dollars 
per test. Several studies have shown that the cost of nation¬ 
wide PKU screening is significantly less than the savings it 
achieves by avoiding institutionalization costs and lost 
productivity. 

These principles apply to many of the other inborn errors that 
are now screened for in the expanded newborn screening 
programs in North America and Europe (see Chapter 7). 
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components: sensitivity and specificity. Sensitivity reflects 
the ability of the test to correctly identify those with the 
disease. It is measured as the fraction of affected persons in 
whom the test is positive (i.e., true positives). Specificity is 
the ability of the test to correctly identify those without the 
disease. It is measured as the fraction of unaffected persons 
in whom the test is negative (i.e., true negatives). Sensitivity 
and specificity are determined by comparing the screening 
results with those of a definitive diagnostic test (Table 13-1). 

Screening tests are seldom, if ever, 100% sensitive and 
100% specific. This is because the range of test values in the 
disease population overlaps that of the unaffected population 
(Fig. 13-1). Thus, results of a screening test (as opposed to 
the definitive follow-up diagnostic test) will be incorrect 
for some members of the population. Usually a cutoff value 
is designated to separate the diseased and nondiseased 
portions of the population. A tradeoff exists between the 
impact of nondetection or low sensitivity (i.e., an increased 
false-negative rate) and the impact of low specificity (an 
increased false-positive rate). If the penalty for missing 
affected persons is high (as in untreated PKU), then the cutoff 
level is lowered so that nearly all disease cases will be detected 
(higher sensitivity). This also lowers the specificity by increas¬ 
ing the number of unaffected persons with positive test 


TABLE 13-1 

and Specificity 

Definitions of Sensitivity 

RESULT OF 

ACTUAL DISEASE STATE | 

SCREENING TEST 

AFFECTED 

UNAFFECTED 

Test positive (-t-) 

Test negative (-) 

a (true positives) 
c (false negatives) 

b (false positives) 
d (true negatives) 


*a, b, c, and d represent the number of individuals in a population 
who were found to have the disease and test result combinations 
shown. The test sensitivity = a/(a + c); specificity = d/(b + d); 
positive predictive value = a/(a + b); and negative predictive value = 
d/(c + d). 


results (false positives) who are targeted for subsequent diag¬ 
nostic testing. If confirmation of a positive test is expensive 
or hazardous, then false-positive rates are minimized (i.e., the 
cutoff level is increased, producing high specificity at the 
expense of sensitivity). 

The basic elements of a test's validity include 
its sensitivity (proportion of true positives accu¬ 
rately detected) and specificity (proportion of 
true negatives accurately detected). When sen¬ 
sitivity is increased, specificity decreases, and 
vice versa. 

A primary concern in the clinical setting is the accuracy 
of a positive screening test. One needs to know the fraction 
of persons with a positive test result who truly have the 
disease in question (i.e., a/(a-l- b) in Table 13-1). This quantity 
is defined as the positive predictive value. It is also useful to 
know the negative predictive value, which is the fraction of 
persons with a negative result who truly do not have the 
disease (d/(c -r d)). 

The concepts of sensitivity, specificity, and positive predic¬ 
tive value can be illustrated by an example. Congenital adrenal 
hyperplasia (CAH) due to a deficiency of 21-hydroxylase is 
an inborn error of steroid biosynthesis that can produce 
ambiguous genitalia in females and adrenal crises in both 
sexes. The screening test, a 17-hydroxyprogesterone assay, 
has a sensitivity of about 95% and a specificity of 99% 
(Table 13-2). The prevalence of CAH is about 1/10,000 in 
most white populations, but it rises to about 1/400 in the 
Yupik Native Alaskan population. 

Let us assume that a screening program for CAH has been 
developed in both of these populations. In a population of 
500,000 whites, the false-positive rate (1—specificity) is 1%. 
Thus, about 5000 unaffected persons will have a positive test. 
With 95% sensitivity, 47 of the 50 white persons who have 
CAH will be detected through a positive test. Note that the 




Creatine kinase (mU/mL) 

FIG 13-1 The distribution of creatine kinase (CK) in normal women and in women who are het¬ 
erozygous carriers for a mutation in the Duchenne muscular dystrophy gene. Note the overlap 
in distribution between the two groups: About two thirds of carriers have CK levels that exceed 
the 95th percentile in normal women. If the 95th percentile is used as a cutoff to identify carri¬ 
ers, then the sensitivity of the test is 67% (i.e., two thirds of carriers will be detected), and the 
specificity is 95% (i.e., 95% of normal women will be correctly identified). 
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TABLE 13-2 Hypothetical Results of 
Screening for Congenital Adrenal 
Hyperplasia in a Low-Prevalence White 
Population and in a High-Prevalence 


Yupik Population 



RESULT OF 

SCREENING TEST 

CAH PRESENT 

CAH ABSENT 

Positive 

White 

47 

5000 

Yupik 

24 

100 

Negative 

White 

3 

494,950 

Yupik 

1 

9875 


*White positive predictive vaiue = 47/(47 + 5000) = 1 %; Yupik 
positive predictive value = 24/(24 + 100) = 19%. CAH, congenitai 
adrenai hyperplasia. 


great majority of people who have a positive test result would 
not have CAH: the positive predictive value is 47/(47 + 5000), 
or less than 1%. Now suppose that 10,000 members of the 
Yupik population are screened for CAH. As Table 13-2 shows, 
24 of 25 persons with CAH will test positive, and 100 persons 
without CAH will also test positive. Here, the positive predic¬ 
tive value is much higher than in the white population: 24/ 
(24 + 100) = 19%. This example illustrates an important 
principle: For a given level of sensitivity and specificity, the 
positive predictive value of a test increases as the prevalence of 
the disease increases. 

The positive predictive value of a screening 
test is defined as the percentage of positive 
tests that are true positives. It increases as the 
prevalence of the target disorder increases. 

Newborn Screening for Inborn 
Errors of Metabolism 

Newborn screening programs represent an ideal opportunity 
for presymptomatic detection and prevention of genetic 
disease. At present, all states in the United States screen new¬ 
borns for PKU, galactosemia (see Chapter 7), and hypothy¬ 
roidism. All of these conditions fulfill the previously stated 
criteria for population screening. Each is a disorder in which 
the person is at significant risk for intellectual disability, 
which can be prevented by early detection and effective 
intervention. 

In recent years, most states of the United States and many 
other nations have instituted screening programs to identify 
neonates with hemoglobin disorders (e.g., sickle cell disease). 
These programs are justified by the fact that up to 15% of 
untreated children with sickle cell disease die from infections 
before 5 years of age (see Chapter 3). Effective treatment, in 
the form of prophylactic antibiotics, is available. Some com¬ 
munities have begun screening for Duchenne muscular 


dystrophy by measuring creatine kinase levels in newborns. 
The objective is not presymptomatic treatment; rather it is 
identification of families who should receive genetic counsel¬ 
ing in order to make informed reproductive decisions. 

Increasingly, tandem mass spectrometry is used to screen 
newborns for protein variants that signal amino acid disor¬ 
ders (e.g., PKU, tyrosinemia, homocystinuria), organic acid 
disorders, and fatty acid oxidation disorders (e.g., MCAD and 
TCHAD deficiencies; see Chapter 7). This method begins 
with a sample of material from a dried blood spot, which is 
subjected to analysis by two mass spectrometers. The first 
spectrometer separates ionized molecules according to their 
mass, and the molecules are fragmented. The second spec¬ 
trometer assesses the mass and charge of these fragments, 
which allows a computer to generate a molecular profile of 
the sample. Tandem mass spectrometry is highly accurate and 
very rapid: more than 30 disorders can be screened in approx¬ 
imately 2 minutes. 

As screening technologies have continued to advance, 
many countries have established expanded newborn screen¬ 
ing programs to deal with positive results and to provide 
rapid treatment of verified disease conditions. The American 
College of Medical Genetics and Genomics has developed 
guidelines and action plans for all of the conditions that are 
currently screened (https://www.acmg.net) and is consis¬ 
tently developing new plans for disorders for which screening 
is being considered (e.g., Pompe disease). Conditions for 
which newborn screening is commonly performed are sum¬ 
marized in Table 13-3. 

Newborn screening is an effective public 
health strategy for treatable disorders such as 
PKU, hypothyroidism, galactosemia, and sickle 
cell disease. The use of tandem mass spec¬ 
trometry has expanded the number of 
diseases that can be detected by newborn 
screening. 

Heterozygote Screening 

The aforementioned principles of population screening can 
be applied to the detection of unaffected carriers of disease- 
causing mutations. The target population is a group known 
to be at risk. The intervention consists of the presentation of 
risk figures and options such as prenatal diagnosis. Genetic 
diseases amenable to heterozygote screening are typically 
autosomal recessive disorders for which prenatal diagnosis 
and genetic counseling are available, feasible, and accurate. 

An example of a highly successful heterozygote screening 
effort is the Tay-Sachs screening program in North Tkmerica. 
Infantile Tay-Sachs disease is an autosomal recessive lyso¬ 
somal storage disorder in which the lysosomal enzyme 
[J-hexosaminidase A (HEX A) is deficient (see Chapter 7), 
causing a buildup of the substrate, Gm 2 ganglioside, in neu¬ 
ronal lysosomes. The accumulation of this substrate damages 
the neurons and leads to blindness, seizures, hypotonia, and 
death by about 5 years of age. Tay-Sachs disease is especially 
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TABLE 13-3 

Characteristics of Selected Newborn Screening Programs 

DISEASE 

INHERITANCE 

PREVALENCE 

SCREENING TEST 

TREATMENT 

Phenylketonuria 

Autosomal recessive 

1/10,000-1/15,000 

Tandem mass spectrometry 

Dietary restriction of 
phenylalanine 

Galactosemia 

Autosomal recessive 

1/50,000-1/100,000 

Transferase assay 

Dietary restriction of galactose 

Congenital 

hypothyroidism 

Usually sporadic 

1/5000 

Measurement of thyroxine 
(TJ or thyroid stimulating 
hormone (TSH) 

Hormone replacement 

Sickle cell disease 

Autosomal recessive 

1/400-1/600 blacks 

Isoelectric focusing or DNA 
diagnosis 

Prophylactic penicillin 

Cystic fibrosis 

Autosomal recessive 

1/2500 

Immunoreactive trypsinogen 
confirmed by DNA diagnosis 

Antibiotics, chest physical 
therapy, pancreatic enzyme 
replacement if needed 


Data from guidelines of the American College of Medical Genetics, http://www.acmg.net/resources/policies/ACT/condition-analyte-links.htm. 


TABLE 13-4 

Ethnic Groups 

Selected Examples of Heterozygote Screening Programs in Specific 

DISEASE 

ETHNIC GROUP 

CARRIER FREQUENCY 

AT-RISK COUPLE 
FREQUENCY 

DISEASE INCIDENCE 
IN NEWBORNS 

Sickle cell disease 

Blacks 

1/12 

1/150 

1/600 

Tay-Sachs disease 

Ashkenazi Jews 

1/30 

1/900 

1/3600 

|3-Thalassemia 

Greeks, Italians 

1/30 

1/900 

1/3600 

a-Thalassemia 

Southeast Asians, Chinese 

1/25 

1/625 

1/2500 

Cystic fibrosis 

Northern Europeans 

1/25 

1/625 

1/2500 

Phenylketonuria 

Northern Europeans 

1/50 

1/2500 

1/10,000 


Modified from McGinniss MJ, Kaback MM. Carrier screening. In: Rimoin DL, Conner JM, Pyeritz RE, Korf BR, eds. Emery and Rimoin's 
Principles and Practice of Medical Genetics, 5th ed. New York: Churchill Livingstone; 2007, pp 752-762. 


common among Ashkenazi Jews, with a heterozygote fre¬ 
quency of about 1 in 30. Thus, this population is a reasonable 
candidate for heterozygote screening. Accurate carrier testing 
is available (assays for HEX A or direct DNA testing for muta¬ 
tions). Because the disease is uniformly fatal, options such as 
pregnancy termination or artificial insemination by noncar¬ 
rier donors are acceptable to many couples. A well-planned 
effort was made to educate members of the target population 
about risks, testing, and available options. As a result of 
screening, the number of Tay-Sachs disease births among 
Ashkenazi Jews in the United States and Canada declined by 
90%, from 30 to 40 per year before 1970, to 3 to 5 per year 
in the 1990s, and to zero by 2003. 

P-Thalassemia major, another serious autosomal recessive 
disorder, is especially common among many Mediterranean 
and South Asian populations (see Chapter 3). Effective carrier 
screening programs have produced a 75% decrease in the 
prevalence of newborns with this disorder in Greece, Cyprus, 
and Italy. Carrier screening is also possible for cystic fibrosis, 
another autosomal recessive disorder (Clinical Commentary 
13-2). Table 13-4 presents a list of selected conditions for 
which heterozygote screening programs have been developed 
in many countries. 

In addition to the criteria for establishing a population 
screening program for genetic disorders, guidelines have 
been developed regarding the ethical and legal aspects of 


BOX 13-2 Public Policy Guidelines for 
Heterozygote Screening 


Recommended guidelines: 

Screening should be voluntary, and confidentiality must be 
ensured. 

Screening requires informed consent. 

Providers of screening services have an obligation to ensure 
that adequate education and counseling are included in the 
program. 

Quality control of all aspects of the laboratory testing, includ¬ 
ing systematic proficiency testing, is required and should 
be implemented as soon as possible. 

There should be equal access to testing. 

From Elias S, Annas GJ, Simpson JL. Carrier screening for cystic 
fibrosis: implications for obstetric and gynecologic practice. Am J 
Obstet Gyneco/1991;164:1077-1083. 


heterozygote screening programs. These are summarized in 
Box 13-2. 

Heterozygote screening consists of testing (at 
the phenotype or genotype level) a target popu¬ 
lation to identify unaffected carriers of a disease 
gene. The carriers can then be given informa¬ 
tion about risks and reproductive options. 
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CLINICAL COMMENTARY 13-2 
Population Screening for Cystic Fibrosis 

Approximately 2,000 mutations have now been reported in the 
CFTR gene, and, although some are benign variants, most can 
cause cystic fibrosis. It would be technologically impractical to 
test for all reported mutations in a population screening 
program. However, among the mutations that can cause CF in 
whites, about 70% are the three-base deletion termed F508del 
(see Chapter 4). In this population, carrier screening using PCR- 
based detection of F508del would detect approximately 90% 
of couples in which one or both are heterozygous carriers of 
this mutation (1-0.30^, where 0.30^ represents the frequency 
of carrier couples in whom neither carries the F508del muta¬ 
tion). It is now recommended to test simultaneously for 
25 of the most common CFTR mutations, which will detect 
approximately 85% of all CFTR mutations in persons of Euro¬ 
pean descent (because mutation frequencies vary among 


populations, this figure is somewhat lower in other U.S. popula¬ 
tions, such as African-Americans and Hispanics). Among 
whites, 98% of couples in which one or both members carry 
a CF mutation would be recognized (i.e., 1-0.15^), yielding a 
high level of sensitivity. The American College of Medical 
Genetics and Genomics and the American College of Obstetri¬ 
cians and Gynecologists recommend that couples who are 
planning a pregnancy, or who are currently pregnant, should 
be offered a screen for CF carrier status. Couples in which both 
parents are heterozygotes would define a subset of the popula¬ 
tion in which prenatal diagnosis for CF might be offered. CF is 
now commonly screened in the newborn population (expanded 
newborn screening, see above), typically using an immunore- 
active trypsinogen assay, followed, if indicated, by direct 
testing for CFTR mutations. 


Presymptomatic Diagnosis 

With the development of genetic diagnosis by direct muta¬ 
tion detection, presymptomatic diagnosis has become fea¬ 
sible for many genetic diseases. At-risk persons can be tested 
to determine whether they have inherited a disease-causing 
mutation before they develop clinical symptoms of the dis¬ 
order. Presymptomatic diagnosis is available, for example, for 
Huntington disease, adult polycystic kidney disease, and 
autosomal dominant breast/ovarian cancer. By informing 
persons that they do or do not carry a disease-causing muta¬ 
tion, presymptomatic diagnosis can aid in making reproduc¬ 
tive decisions. It can provide reassurance to those who 
learn that they do not carry a disease-causing mutation. 
In some cases, early diagnosis can improve health supervi¬ 
sion. For example, persons who inherit an autosomal domi¬ 
nant breast cancer mutation can undergo mammography at 
an earlier age to increase the chances of early tumor detec¬ 
tion. Persons at risk for the inheritance of RET mutations 
(see Chapter 11), who are highly likely to develop multiple 
endocrine neoplasia type 2 (MEN2), can undergo a prophy¬ 
lactic thyroidectomy to reduce their chance of developing 
a malignancy. Those who inherit mutations that cause 
some forms of familial colon cancer (adenomatous polyposis 
coli [APC] and hereditary nonpolyposis colorectal cancer 
[HNPCC]; see Chapter 11) can also benefit from early diag¬ 
nosis and treatment. 

Because most genetic diseases are relatively uncommon in 
the general population, universal presymptomatic screening 
is currently impractical. It is usually recommended only for 
persons who are known to be at risk for the disease, generally 
because of a positive family history. 

Genetic testing can sometimes be performed 
to identify persons who have inherited a 
disease-causing gene before they develop 
symptoms. This is termed presymptomatic 
diagnosis. 


Psychosocial Implications of Genetic 
Screening and Diagnosis 

Screening for genetic diseases has many social and psycho¬ 
logical implications. The burden of anxiety, cost, and poten¬ 
tial stigmatization surrounding a positive test result must be 
weighed against the need for detection. Often, screening tests 
are misperceived as diagnostically definitive. The concept that 
a positive screening test does not necessarily indicate disease 
presence must be emphasized to those who undergo screen¬ 
ing (see Box 13-2). 

The initial screening programs for sickle cell carrier status in 
the 1970s were plagued by misunderstandings about the impfi- 
cations of carrier status. Occasionally, carrier detection led to 
cancellation of health insurance or employer discrimination. 
Such experiences underscore the need for effective genetic coun¬ 
seling and public education. Other issues include the right to 
choose not to be tested and the potential for invasion of privacy. 

The social, psychological, and ethical aspects of genetic 
screening will become more complicated as newer techniques 
of DNA diagnosis become more accessible. For example, even 
though presymptomatic diagnosis of Huntington disease is 
available, several studies have shown that fewer than 20% of 
at-risk persons elected this option. Largely, this reflects the 
fact that no effective treatment is currently available for this 
disorder. Presymptomatic diagnosis of BRCAl and BRCA2 
(see Chapter 11) carrier status has also met with mixed 
responses. In part, this has been a reaction to the cost of the 
test: because of the large number of different mutations in 
BRCAl or BRCA2 that can cause breast or ovarian cancer, 
testing typically consists of sequencing all exons and promot¬ 
ers of both genes, as well as some intronic nucleotides near 
each exon. This has been a costly procedure, although new 
technologies and increasing competition are now decreasing 
the cost. Preventive measures such as prophylactic mastec¬ 
tomy and oophorectomy (removal of the breasts and ovaries, 
respectively) are known to reduce cancer risk by approxi¬ 
mately 90%, but they do not completely eliminate it. 









CHAPTER 13 Genetic Testing and Gene Therapy 


271 


For some genetic diseases, such as autosomal dominant 
colon cancer syndromes, early diagnosis can lead to improved 
survival because effective preventive treatments are readily 
available (colectomy or polypectomy for precancerous colon 
polyps). In addition, many at-risk persons will find that they 
do not carry a disease-causing allele, allowing them to avoid 
unpleasant (and possibly hazardous) diagnostic procedures 
such as colonoscopy or mammography. However, as screen¬ 
ing for such diseases becomes more common, the issues of 
privacy and confidentiality and the need for accurate com¬ 
munication of risk information must also be addressed. 

MOLECULAR TOOLS FOR SCREENING 
AND DIAGNOSIS 

In the past, genetic screening usually relied on assays of 
the disease phenotype, such as a (J-hexosaminidase assay for 
Tay-Sachs disease or a creatine kinase assay for Duchenne 
muscular dystrophy. Advances in DNA technology have 
led to diagnosis at the level of the genotype. In most 
cases direct assays of disease-causing mutations have 
been developed. Genetic diagnosis at the DNA level is now 
supplementing, and in many cases supplanting, tests based 
on phenotypic assays. 

Linkage analysis and direct mutation diagnosis have been 
used for diagnostic testing within families, for prenatal diag¬ 
nosis of genetic disorders, and, more recently, in population 
screening. Improved technology and an increased demand 
for testing have led to the establishment of clinical molecular 
laboratories in many medical centers throughout the world. 

Linkage Analysis 

DNA polymorphisms such as SNPs or STRs can be used as 
markers in linkage analysis, as described in Chapter 8. Once 
linkage phase is established in a family, the marker locus can 
be assayed to determine whether an at-risk person has inher¬ 
ited a chromosome segment that contains a disease-causing 
mutation or a homologous segment that contains a normal 
allele (Fig. 13-2). Because this approach uses linked markers 
but does not involve direct examination of the disease-causing 
mutations, it is a form of indirect diagnosis. 

Linkage analysis has been employed successfully in diag¬ 
nosing many of the genetic diseases discussed in this text. In 
principle, it can be used to diagnose any mapped genetic 
disease. It has the advantages that the disease-causing gene 
and its product need not be known. The marker simply tells 
us whether or not the at-risk person has inherited the chro¬ 
mosome region that contains a disease-causing mutation. 

The disadvantages of this approach are that several family 
members must be tested in order to establish linkage phase; 
not all markers are informative (sufficiently heterozygous) in 
all families (see Chapter 8 for a discussion of uninformative 
families); and recombination can occur between the marker 
and the disease-causing mutation, introducing a source of 
diagnostic error. With advances in direct mutation detection, 
indirect diagnosis with linked markers is applied less com¬ 
monly than in the past, but it is still useful when a 



FIG 13-2 In this pedigree for autosomal dominant breast 
cancer, the analysis of a closely linked marker on chromo¬ 
some 17 shows that the mutation is on the same chromo¬ 
some as marker allele 1 in the affected mother in generation 
II. This indicates that the daughter in generation III has inher¬ 
ited the mutation-bearing chromosome from her mother and 
is highly likely to develop a breast tumor. 


disease-causing mutation is being transmitted in a family but 
cannot be directly identified. 

Linkage analysis, a form of indirect genetic 
diagnosis, uses linked markers to determine 
whether a person has inherited a chromosome 
containing a disease gene from his or her 
parent. The need fortyping of numerous family 
members and the possibilities of recombina¬ 
tion and uninformative matings are disadvan¬ 
tages of this approach. 

Direct Mutation Analysis 

Direct diagnosis, in which the disease-causing mutation 
itself is detected, has become the approach used most com¬ 
monly in genetic diagnosis. Compared to indirect diagnosis, 
it has the advantages that family information is not needed 
(the mutation is viewed directly in each individual), uninfor¬ 
mative matings are not a problem, and there is no error 
resulting from recombination. (Table 13-5 summarizes the 
advantages and disadvantages of direct and indirect diagno¬ 
sis.) Several forms of direct diagnosis (Table 13-6) have been 
described in preceding chapters. For example, fluorescence in 
situ hybridization (FISH) or array comparative genomic 
hybridization (array CGH) can be used to detect deletions or 
duplications of chromosome regions (see Chapter 6). South¬ 
ern blotting or multiplex ligation-dependent probe amplifi¬ 
cation (MLPA) can be used to detect copy number variants 
or large repeat expansions (see Chapter 3). 

If the DNA sequence surrounding a mutation is known, 
the region containing the sequence can be amplified by PCR 
(see Chapter 3) and an oligonucleotide probe can be synthe¬ 
sized that will hybridize (undergo complementary base 
pairing) only to the mutated sequence (such probes are often 
termed allele-specific oligonucleotides, or ASOs). A second 
probe that will hybridize to the normal DNA sequence is also 
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synthesized. Stringent hybridization conditions are used so 
that a one-base mismatch will prevent hybridization. DNA 
from persons who are homozygous for the mutation hybrid¬ 
izes only with the ASO containing the mutated sequence, 
whereas DNA from persons homozygous for the normal 
sequence hybridizes with the normal ASO. DNA from hetero¬ 
zygotes hybridizes with both probes (Fig. 13-3). The length 
of the ASO probes, usually about 18 to 20 nucleotides, is 
critical. Shorter probes would not be unique in the genome 
and would therefore hybridize to multiple regions. Longer 
probes are more difficult to synthesize correctly and could 
hybridize to both the normal and the mutated sequence. 


TABLE 13-5 Summary of Attributes of 

Indirect and Direct Diagnosis 



INDIRECT 

DIRECT 

ATTRIBUTE 

DIAGNOSIS 

DIAGNOSIS 

Family information needed 

Yes 

No 

Errors possible because of 

Yes 

No 

recombination 

Markers may be 

Yes 

No 

uninformative 

Single test can uncover 

Yes 

No 

multiple mutations 
Disease-causing mutation 

No 

Yes 

must be known 




Genetic diagnosis using ASO probes requires that the 
disease-causing gene has been identified and sequenced. In 
addition, each disease-causing mutation requires a different 
oligonucleotide probe. For this reason, this approach, while 
powerful, can become more challenging if the disease can be 
caused by a large number of different mutations that each 
have low frequency in the population. 

Direct genetic diagnosis is accomplished by 
assaying the disease-causing mutation itself. It 
is potentially more accurate than indirect diag¬ 
nosis and does not require family information. 
Direct diagnosis can be performed by hybrid¬ 
ization of a person's DNA with allele-specific 
oligonucleotide probes. This approach is fea¬ 
sible if the DNA sequence causing a genetic 
disease is known and if the number of disease- 
causing mutations is limited. 

Examples of diseases caused by a limited number of muta¬ 
tions include sickle cell disease and Oi-antitrypsin deficiency 
(Clinical Commentary 13-3). Although approximately 2,000 
cystic fibrosis mutations have been identified, 25 of the most 
common ones account for the great majority of mutations in 
many populations (see Clinical Commentary 13-2). Thus, 
direct diagnosis can be used to identify most cystic fibrosis 
homozygotes and heterozygous carriers. Prenatal diagnosis is 



TABLE 13-6 

Methods of Direct Genetic Testing 





COPY NUMBER 

VARIANTS 

(INCLUDING 





SINGLE-GENE 

DELETIONS AND 

UNIPARENTAL 

REPEAT 


METHOD 

MUTATIONS 

DUPLICATIONS) 

DISOMY 

EXPANSIONS 

EXAMPLES OF APPLICATIONS 

FISH 


X 



Prader-Willi syndrome: Williams 






syndrome: aneuploidy 

Array CGH 


X 

X 


Intellectual disability or other 






conditions of unknown etiology: 
suspected microdeletions, 
microduplications, or aneuploidy 

SNP microarrays 

X 

X 



Drug sensitivity: intellectual 






disability: cardiovascular 
disease risk 

Allele-specific 

X 



X 

Sickle cell disease: ai-antitrypsin 

oligonucleotide 
(ASO) hybridization 





deficiency 

Sanger DNA 

X 




Familial breast/ovarian cancer 

sequencing 
Southern blot or 


X 


X 

Myotonic dystrophy: fragile X 

MLPA 





syndrome 

Targeted disease¬ 

X 




Familial cardiomyopathy: long QT 

sequencing panels 





syndrome: familial colorectal 






cancer 

Whole-exome or 

X 

X 



Undiagnosed conditions or 

whole-genome 

sequencing 





diseases of uncertain etiology 


Adapted from Katsanis SH, Katsanis N. Molecular genetic testing and the future of ciinicai genomics. Nat Rev Genet. 2013:14:415-426. 
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GTG CAC CTG ACT CCT GAG GAG Normal probe 

CAC GTG GAG TGA GGA CTC CTC Normal p-globln 

sequence 
(antisense strand) 


Probe containing 
mutation 

Missense mutation in 
P-globin sequence 

Missense mutation 


GTG CAC CTG ACT CCT GTG GAG 


CAC GTG GAC TGA GGA CAC CTC 


/ 


A 





Probe with normal 
sequence 

o 

o 

o 

o 

Probe with mutated 
sequence 

o 

o 


o o 

B 


Probe hybridization pattern 


FIG 13-3 A, A 21-bp allele-specific oligonucleotide (ASO) 
probe (yellow) is constructed to undergo complementary 
base pairing only with the normal P-globln sequence, and 
another ASO probe (green) Is constructed to undergo com¬ 
plementary base pairing only with a P-globin sequence that 
contains a missense mutation that produces a substitution of 
valine for glutamic acid at position 6 of the P-globin polypep¬ 
tide (see Chapter 3), causing sickle cell disease in homozy¬ 
gotes. B, In this family, the parents are both heterozygous 
carriers of the missense mutation, so their DNA hybridizes 
to both ASO probes. The first female offspring has a homo¬ 
zygous normal genotype, the second offspring is heterozy¬ 
gous, and third offspring is an affected homozygote. A variety 
of methods, including microarrays (see Chapter 3), can be 
used to detect these ASO hybridization patterns. 


also possible. Direct diagnosis can also be used to detect dele¬ 
tions or duplications (e.g., those of the dystrophin gene that 
cause most cases of Duchenne muscular dystrophy). Cur¬ 
rently, clinical genetic testing is available for more than 4000 
genetic diseases, including almost all of the single-gene con¬ 
ditions discussed in this textbook. Despite this wide avail¬ 
ability, it should be kept in mind that genetic testing, like all 
testing procedures, has a number of limitations (Box 13-3). 

Microarrays, in which large numbers of oligonucleotide 
probes on a silicon chip are hybridized with labeled patient 
DNA (see Chapter 3), represent a large-scale extension of the 
ASO technology just described. Microarrays have many con¬ 
venient properties, including miniaturization and automated 
computerized processing. They can be designed to assay large 
numbers of sequence variants (including disease-causing 
mutations) in a single rapid analysis (Box 13-4). For example, 
one microarray contains thousands of oligonucleotide probes 
that hybridize to large numbers of possible sequence variants 
in the CYP2D6 and CYP2C19 genes. The products of these 


CLINICAL COMMENTARY 13-3 

The Genetic Diagnosis of a,-Antitrypsin Deficiency 

An Inherited deficiency of ai-antitrypsin (AAT) is one of the 
most common autosomal recessive disorders among whites, 
affecting approximately 1 in 2000 to 1 in 5000. AAT, synthe¬ 
sized primarily in the liver, is a serine protease inhibitor. As 
its name suggests, it binds trypsin. However, AAT binds 
much more strongly to neutrophil elastase, a protease that 
is produced by neutrophils (a type of leukocyte) in response 
to infections and irritants. It carries out its binding and inhibi¬ 
tory role primarily in the lower respiratory tract, where it 
prevents neutrophil elastase from digesting the alveolar 
septa of the lung. 

Persons with less than 10% to 15% of the normal level of 
AAT activity experience significant lung damage and typically 
develop emphysema during their 30s, 40s, or 50s. In addi¬ 
tion, at least 10% develop liver cirrhosis as a result of the 
accumulation of variant AAT molecules in the liver. AAT defi¬ 
ciency accounts for almost 20% of all nonalcoholic liver cir¬ 
rhosis in the United States. Cigarette smokers with AAT 
deficiency develop emphysema much earlier than do non- 
smokers because cigarette smoke irritates lung tissue, 
increasing secretion of neutrophil elastase. At the same time, 
it inactivates AAT, so there is also less inhibition of elastase. 
One study showed that the median age of death of nonsmok¬ 
ers with AAT deficiency was 62 years, whereas it was 40 
years for smokers with this disease. The combination of ciga¬ 
rette smoking (an environmental factor) and an AAT mutation 
(a genetic factor) produces more-severe disease than either 
factor alone; thus, this is an example of a gene-environment 
interaction. 

Typically, AAT deficiency is tested first by a straightforward 
assay for reduced serum AAT concentration. Because a variety 
of conditions can reduce serum AAT, additional testing, via a 
type of protein electrophoresis (see Chapter 3) or DNA testing, 
is carried out to confirm a diagnosis of AAT deficiency. Direct 
DNA testing became feasible with the identification of 
SERPINAI, the gene that encodes AAT. More than 100 
SERPINA1 mutations have been identified, but only two mis¬ 
sense variants, labeled the Sand Zalleles, are common and 
clinically significant. Approximately ninety-five percent of 
cases of AAT deficiency are either ZZ homozygotes or SZ 
compound heterozygotes. The latter genotype generally pro¬ 
duces less severe disease symptoms. Two large studies have 
indicated that the risk of developing emphysema among ZZ 
homozygotes is 70% for nonsmokers and 90% for smokers. 
Because the great majority of AAT cases are caused by two 
alleles, Zand S, this disease can be diagnosed efficiently and 
with 95% sensitivity by allele-specific DNA testing. If neces¬ 
sary, rare disease-causing mutations can be detected by 
sequencing the entire gene. 


genes influence the rate of metabolism of about 25% of all 
prescription drugs, and analysis of their variation might help 
to predict how individual patients will respond to these drugs. 

Although microarray technology is relatively inexpensive 
and efficient, it allows only the detection of the DNA variants 
present in the probes included in the microarray platform. 
This drawback can be overcome with high-throughput 
DNA sequencing (see Chapters 3 and 8), which has become 
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BOX 13-3 Limitations of Genetic Testing 


Although genetic testing offers many advantages, its limita¬ 
tions must also be borne in mind. These limitations can be 
summarized as follows: 

• No genetic test is 100% accurate. Although most genetic 
tests do achieve a high level of accuracy, factors such as 
mosaicism can complicate cytogenetic diagnosis, and geno- 
typing errors can occur in the diagnosis of single-gene 
disorders. 

• Genetic tests reveal mutations, not the presence of disease, 
because many disease-causing mutations have incomplete 
penetrance. For example, approximately 50% to 80% of 
women with BRCAl or BRCA2 mutations develop breast 
cancer, and 70% to 90% of persons with mutations in one 
of the HNPCC genes develop colorectal cancer. Even when 
penetrance approaches 100% (as in neurofibromatosis type 
1 or Huntington disease), detection of the mutation often 
reveals little about the severity or age of onset of the disease. 

• Genetic testing might not detect all of the mutations that 
can cause a disease. Even in the absence of genotyping or 
sequencing errors, many genetic tests lack sensitivity. For 
example, the panels commonly used to test for cystic fibro¬ 
sis mutations have typically less than 90% sensitivity to 
detect homozygotes (see Clinical Commentary 13-2). When 


a large number of different mutations can produce a genetic 
disease (e.g., neurofibromatosis, autosomal dominant breast 
cancer, Marfan syndrome), it might not be practical to test 
for all possible mutations. In this case, the analysis of linked 
markers can provide additional diagnostic accuracy if multi¬ 
ple family members are affected. Other factors that can 
reduce accuracy include locus heterogeneity and the pres¬ 
ence of phenocopies. 

Genetic testing can lead to complex ethical and social con¬ 
siderations. The results of a genetic test might lead to stig¬ 
matization or to discrimination by employers or insurance 
companies. (However, a federal law, the Genetic Information 
Nondiscrimination Act, now prohibits discrimination by 
employers or health insurance providers based on the results 
of genetic testing [see Chapter 15].) Effective treatment is 
not available for some genetic diseases (e.g., Huntington 
disease, familial Alzheimer disease), decreasing the value of 
early diagnosis through genetic testing. Because genes are 
shared in families, the results of a genetic test might affect 
not only the tested person but also other members of the 
family (who might not wish to know about their risk for a 
genetic disease). These and other ethical and social issues 
are discussed further in Chapter 15. 


BOX 13-4 Direct-to-Consumer Genetic Testing 


Several private companies now offer microarray-based genetic 
testing on a direct-to-consumer basis. Typically, the customer 
collects and submits a cheek swab or mouthwash. DNA is 
extracted from the sample and hybridized to a microarray that 
can test simultaneously for a large number of DNA variants, 
including some of the variants associated, for example, with 
cystic fibrosis, hemochromatosis, age-related macular degen¬ 
eration, type 1 diabetes, type 2 diabetes, psoriasis, and breast, 
prostate, and colorectal cancer. The customer is informed of 
the results and is given some explanatory information to aid in 
interpreting them. In some cases, genetic counseling is avail¬ 
able. The cost of this procedure typically ranges from one 
hundred to several thousand dollars. 

This type of testing, sometimes termed "recreational genom¬ 
ics," has understandable appeal. Many people want to know 
more about their genomes and how DNA variation might affect 
their health. Many will likely present the results of these tests 
to their primary care physicians, expecting explanations and 
even predictions. Several Important considerations should be 
kept in mind. 


For most disease conditions, these tests have relatively low 
sensitivity and low specificity. A positive result seldom pre¬ 
dicts disease with precision (see Box 13-3), and a negative 
result should not Induce a false sense of security. For common 
multifactorial diseases, most of the responsible genetic vari¬ 
ants have not yet been identified, and, as discussed in Chapter 
12, nongenetic factors typically play a large role In causing the 
disease. The relative increase in disease risk associated with 
most variants is quite small, on the order of a few percentage 
points. These risk estimates are usually based on specific 
populations, typically Europeans or Americans of European 
descent; they might not apply accurately to members of other 
populations. There is considerable potential for misunder¬ 
standing of these results, and many of the concerns discussed 
in Box 13-3 (stigmatization, potential for loss of privacy) apply 
to dIrect-to-consumer testing. For these reasons, this type of 
genetic testing should be regarded with considerable caution. 


increasingly rapid and affordable. In disease-targeted 
sequencing, only the genes known to be associated with a 
specific disease are sequenced. For example, an individual 
with a positive family history of colorectal cancer could be 
tested for disease-causing variants in any of the dozen or so 
genes in which mutations have been shown to cause heredi¬ 
tary colorectal cancer (see Chapter 11). This approach has the 
advantage of detecting all pathogenic variants, including 


previously undiscovered ones, in the genes included in the 
testing panel. Disease-targeted sequencing panels have been 
developed for a number of hereditary conditions, including 
cardiomyopathies, cardiac arrhythmias (e.g., long QT syn¬ 
drome), muscular dystrophies, and many hereditary cancers 
(e.g., cancers of the breast, colon, ovaries, or pancreas). 

Disease-targeted sequencing is necessarily limited to the 
genes included in the sequencing panel. Whole-exome and 
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whole-genome sequencing (see Chapters 3 and 8) can over¬ 
come this limitation by detecting nearly all disease-causing 
variants in an individual genome. However, the enormous 
amount of data generated by exome and whole-genome 
sequencing presents additional challenges. As discussed in 
Chapter 8, each human exome contains hundreds of potential 
disease-causing candidate variants. Many of these are vari¬ 
ants of unknown significance whose effect on the phenotype 
is not yet known or understood. Large-scale efforts, involving 
sophisticated computer algorithms, ever-expanding data¬ 
bases of reported pathogenic variants, and functional studies 
of model organisms, are improving our ability to discrimi¬ 
nate disease-causing variants from harmless ones. 

A second challenge is that individuals who undergo exome 
or whole-genome sequencing for an indicated condition (e.g., 
a family history of colorectal cancer) are sometimes found to 
harbor a known pathogenic variant that causes another disease 
(e.g., an inherited cardiomyopathy). Currently, it is recom¬ 
mended to report these incidental findings to the patient if 
they are well validated and cause a condition that is considered 
to be clinically “actionable” (see the American College of 
Medical Genetics guidelines summarized by Green et al., 
2013). As the number of sequenced individuals grows, and as 
our understanding of the genetic basis of diseases increases, 
the list of actionable sequence variants is like to expand. 

New methods of direct mutation detection, 
including the use of microarrays and high- 
throughput sequencing, have greatly increased 
the speed and accuracy of genetic screening 
and diagnosis. Challenges raised by exome 
and whole-genome sequencing include large 
numbers of variants of unknown significance 
and issues regarding the reporting of inciden¬ 
tal findings. 


PRENATAL DIAGNOSIS OF GENETIC 
DISORDERS AND CONGENITAL DEFECTS 

Prenatal diagnosis is a major focus of genetic testing, and 
several important areas of technology have evolved to provide 
this service. The principal aim of prenatal diagnosis is to 
supply at-risk families with information so that they can 
make informed choices during pregnancy. The potential ben¬ 
efits of prenatal testing include providing reassurance to 
at-risk families when the result is normal; providing risk 
information to couples who, in the absence of such informa¬ 
tion, would not choose to begin a pregnancy; allowing a 
couple to prepare psychologically for the birth of an affected 
baby; helping the health-care professional to plan delivery, 
management, and care of the infant when a disease is diag¬ 
nosed in the fetus; and providing risk information to couples 
for whom pregnancy termination is an option. 

Given the controversy surrounding the issue of pregnancy 
termination, it should be emphasized that the great majority 
of prenatal diagnoses yield a normal test result. Thus, most 


families receive reassurance, and only a small minority face 
the issue of pregnancy termination. 

Both screening and diagnostic tests can be done prenatally. 
An example of a population screening test is the analysis of 
maternal serum at 15 weeks’ gestation for increased or 
decreased levels of a-fetoprotein (AFP) and several other 
serum components that can indicate an abnormal pregnancy. 
A positive test result identifies a subgroup for further testing 
for aneuploidy syndromes or neural tube defects (NTDs). A 
subsequent amniocentesis (the withdrawal of amniotic fluid 
during pregnancy) would represent a more accurate, specific 
diagnostic test. Prenatal diagnostic methods can be divided 
into two major types: analysis of fetal tissues (amniocentesis, 
chorionic villus sampling, cordocentesis, and preimplanta¬ 
tion genetic diagnosis) and visualization of the fetus (ultra¬ 
sonography, magnetic resonance imaging). Amniocentesis is 
considered an invasive method of prenatal diagnosis, in con¬ 
trast to noninvasive methods such as maternal serum screen¬ 
ing tests and fetal visualization. In this section, each of these 
procedures is described, and their accuracy, safety, and feasi¬ 
bility are discussed. 

Amniocentesis 

Amniocentesis is typically performed at 15 to 17 weeks after 
a pregnant woman’s last menstrual period (LMP). After real¬ 
time ultrasound imaging localizes the placenta and deter¬ 
mines the position of the fetus, a needle is inserted through 
the abdominal wall into the amniotic sac (Fig. 13-4). Between 
20 and 30 mL of amniotic fluid is withdrawn; this fluid con¬ 
tains living cells (amniocytes) shed by the fetus. The amnio- 
cytes are cultured to increase their number (a procedure that 
requires up to 7 days), and standard cytogenetic studies are 
carried out on the cultured amniocytes. In addition, cells can 
be grown for biochemical assays or DNA-based diagnosis of 
any genetic disease for which mutation testing is available. 
The results of cytogenetic studies are typically available in 10 
to 12 days. Because fluorescent in situ hybridization (FISH) 
can be carried out on a small number of uncultured amnio¬ 
cytes, it can provide an indication of fetal aneuploidy in just 
1 to 2 days. If the FISH result is positive, subsequent, confir¬ 
matory diagnosis by routine cytogenetic methods is recom¬ 
mended. Indications for prenatal diagnosis by amniocentesis 
are listed in Box 13-5. 

Amniocentesis is also used to measure a-fetoprotein 
(AFP), a fetal protein that is produced initially by the yolk sac 
and subsequently by the fetal liver. The AFP level normally 
increases in amniotic fluid until about 10 to 14 weeks’ gesta¬ 
tion and then decreases steadily. Amniotic fluid AFP is sig¬ 
nificantly higher in pregnancies in which the fetus has an 
NTD. When an amniotic fluid AFP assay is used with ultra¬ 
sonography (see later) in the second trimester, more than 
98% of fetuses with an open spina bifida and virtually all of 
those with anencephaly can be recognized. Among women 
who undergo amniocentesis for cytogenetic analysis, it is 
routine to also measure their amniotic fluid AFP level. 

In addition to a fetal NTD, there are several other causes 
of elevated (or apparently elevated) amniotic fluid AFP. These 
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FIG 13-4 A schematic illustration of an amniocentesis, in which 20 to 30 mL of amniotic fluid is 
withdrawn transabdominally (with ultrasound guidance), usually at 15 to 17 weeks' gestation. 



include underestimation of gestational age, fetal death, pres¬ 
ence of twins, blood contamination, and several specific mal¬ 
formations (e.g., omphalocele or gastroschisis, which are 
abdominal wall defects). Usually, targeted ultrasonography 
can distinguish among these alternatives. 

The safety and accuracy of amniocentesis have been estab¬ 
lished by several large collaborative studies. The risk of 
maternal complications is very low. Transient fluid leakage 
occurs in about 1% of mothers, and maternal infections are 
extremely rare. The risk of primary concern is fetal loss. 
Amniocentesis increases the risk of fetal loss by no more than 
0.2% to 0.3% (approximately 1/300 to 1/500) above the back¬ 
ground risk at 15 to 17 weeks post-LMP (i.e., if the risk of 


pregnancy loss after 17 weeks were 3% in mothers who did 
not have an amniocentesis, the risk would increase to 
3.2% to 3.3% in those who had the procedure). One must 
weigh the risk of fetal loss against the probability that 
the fetus is affected with a diagnosable condition (Clinical 
Commentary 13-4). 

Although amniocentesis provides highly accurate results, 
chromosomal mosaicism can lead to misdiagnosis. Most 
apparent mosaicism is caused by the generation of an extra 
chromosome during in vitro cell culture and is labeled as 
pseudomosaicism. This can be distinguished easily from true 
mosaicism if techniques are used in which all cells in a colony 
are the descendants of a single fetal cell. If only some cells in 
the colony have the extra chromosome, it is assumed that 
pseudomosaicism exists. If, however, consistent aneuploidy is 
visualized in all cells of multiple colonies, then true fetal 
mosaicism is diagnosed. Further confirmation of fetal mosa¬ 
icism (which is generally a rare condition) can be obtained 
by fetal blood sampling, as described later. 

Some centers have evaluated amniocentesis performed 
earlier in pregnancy, at about 12 to 14 weeks post-LMP. 
Because less amniotic fluid is present at this time, the risk of 
fetal loss or injury may be higher. A number of large-scale 
evaluations have indicated significantly higher rates of 
fetal loss for early amniocentesis, and some studies have 
shown increased rates of specific congenital anomalies (club 
foot in particular). 
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CLINICAL COMMENTARY 13-4 
The Amniocentesis Decision 

When a prenatal screening test (see text) identifies a risk of fetal 
abnormality greater than 1 in 500, it is common for a pregnant 
woman to consider the possibility of amniocentesis. Several 
factors enter into the decision-making process. First is quantita¬ 
tive risk estimate, determined by the screening result, for Down 
syndrome and other chromosomal disorders. A second factor 
is the risk of fetal loss from the procedure (about 0.2% to 0.3% 
above the background risk). A third issue is the expense of an 
amniocentesis with ultrasound and cytogenetic analysis, which 
ordinarily costs about $2000. These factors must be weighed 
in terms of their relative costs and benefits for the woman and 
her family. 

As this decision is explored in greater depth, other consid¬ 
erations often arise. If a woman has had previous miscar¬ 
riages, the additional risk of fetal loss may be weighed more 
heavily. In addition, the seriousness of bearing a child with 
disabilities may be perceived differently from family to family. 
Some couples are uncomfortable with the amount of time 
that elapses before test results are available (usually 10-12 
days). This discomfort should be acknowledged and vali¬ 
dated. The possibility of an ambiguous result (e.g., mosa¬ 
icism) also deserves discussion. Finally, it is important for the 
clinician to specify that an amniocentesis typically detects 
only a specific class of disorders (i.e., chromosome abnor¬ 
malities and neural tube defects) and not the entire range of 
birth defects and genetic disorders. 


Amniocentesis, the withdrawal of amniotic fluid 
during pregnancy, is performed at about 16 
weeks post-LMP and is used to diagnose many 
genetic diseases. The amniotic a-fetoprotein 
level is elevated when the fetus has a neural 
tube defect and provides a reliable prenatal test 
for this condition. The rate of fetal loss attribut¬ 
able to this procedure is approximately 1/300 to 
1/500 above the background risk level. Amnio¬ 
centesis can also be performed earlier in the 
pregnancy; some studies indicate an elevated 
rate of fetal loss after early amniocentesis. 

Chorionic Villus Sampling 

Chorionic villus sampling (CVS), the other commonly used 
invasive prenatal diagnostic technique, is performed by aspi¬ 
rating fetal trophoblastic tissue (chorionic villi) by either a 
transcervical or transabdominal approach (Fig. 13-5). Because 
it is usually performed at 10 to 11 weeks post-LMP, CVS has 
the advantage of providing a diagnosis much earlier in preg¬ 
nancy than second-trimester amniocentesis. This may be 
important for couples who consider pregnancy termination 
an option. 

Cell culture (as in amniocentesis) and direct analysis 
from rapidly dividing trophoblasts can provide material for 




FIG 13-5 A schematic illustration of a transcervical chorionic villus sampling procedure. With ultra¬ 
sound guidance, a catheter is inserted, and several milligrams of villus tissue are aspirated. 
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cytogenetic analysis. When chorionic villi are successfully 
obtained, CVS provides diagnostic results in more than 
99% of cases. Confined placental mosaicism (mosaicism in 
the placenta but not in the fetus itself) is seen in about 
1% to 2% of cases in which direct analysis of villus material 
is performed. This can confuse the diagnosis, because the 
mosaicism observed in placental (villus) material is not 
actually present in the fetus. This problem can usually be 
resolved by a follow-up amniocentesis. A disadvantage 
of CVS is that amniotic fluid AFP cannot be measured. 
Women who undergo CVS may have their serum AFP 
level measured at 15 to 16 weeks after LMP as a screen 
for NTDs. 

CVS, like amniocentesis, is generally a safe procedure. 
Several collaborative studies revealed a post-CVS fetal loss 
rate of approximately 1% above the background rate, com¬ 
pared with 0.2% to 0.3% above background for amniocente¬ 
sis. Factors that increase the risk of fetal loss include a lack of 
experience with the procedure and an increase in the number 
of transcervical passages used to obtain the villus sample. In 
experienced hands, transcervical and transabdominal proce¬ 
dures appear to entail similar risk levels. 

Some studies have indicated that CVS can increase the risk 
of limb deficiencies. Although other investigations have not 
corroborated this result, the apparent association has been of 
concern because the proposed mechanism (vascular insult 
leading to hypoperfusion of the limb) is biologically plausi¬ 
ble. The risk is greatest when CVS is performed earlier 
than 10 weeks post-LMP and decreases to no more than one 
in several thousand when the procedure is performed at 
10 to 11 weeks post-LMP. Accordingly, many professionals 
now recommend against performing CVS before 10 weeks 
after LMP. 

CVS is performed earlierthan amniocentesis (at 
10 to 11 weeks post-LMP), using either a trans¬ 
cervical or a transabdominal approach. The risk 
of fetal loss attributable to CVS is approximately 
1% to 1.5%. Confined placental mosaicism can 
confuse the diagnosis. There is some evidence 
that CVS can increase the risk of limb deficien¬ 
cies; this risk is greatest when the procedure is 
performed before 10 weeks post-LMP. 

Inborn errors of metabolism (see Chapter 7), which are 
usually autosomal or X-linked recessive diseases, can be diag¬ 
nosed prenatally by amniocentesis or CVS if the specific 
metabolic defect is expressed in amniocytes or trophoblastic 
tissue. They can also be diagnosed prenatally by DNA-based 
methods if the disease-causing mutation can be identified. 
Table 13-7 lists selected inborn errors of metabolism and 
single-gene disorders for which amniocentesis or CVS is 
available. 

Other Methods of Fetal Tissue Sampling 

Cordocentesis, or percutaneous umbilical blood sampling 
(PUBS), has become the preferred method to access fetal 


TABLE 13-7 Selected Inborn Errors 
of Metabolism That Are Diagnosable 
through Amniocentesis and/or Chorionic 
Villus Sampling 

1 DISEASE 

MEASURABLE ENZYME | 

Disorders of Amino Acid or Organic Acid Metabolism 

Maple syrup urine 

Branched-chain ketoacid 

disease 

decarboxylase 

Methylmalonic acidemia 

Methylmalonic coenzyme A 
mutase 

Multiple carboxylase 

Biotin responsive carboxylase 

deficiency 

Disorders of Carbohydrate Metabolism 

Glycogen storage 

a-Glucosidase 

disease, type 2 

Galactosemia 

Galactose-1-uridyl transferase 

Disorders of Lysosomal 

Enzymes 

Gangliosidosis (all types) 

|3-Galactosidase 

Mucopolysaccharidosis 

Disease-specific enzyme (see 

(all types) 

Chapter 7) 

Tay-Sachs disease 

Hexosaminidase A 

Disorders of Purine and 

Pyrimidine Metabolism 

Lesch-Nyhan syndrome 

Hypoxanthine-guanine 
phosphoribosyl transferase 

Disorders of Peroxisomal Metabolism 

1 Zellweger syndrome 

Long-chain fatty acids | 


blood. PUBS is usually carried out after the 16th week of 
gestation and is accomplished by ultrasound-guided punc¬ 
ture of the umbilical cord and withdrawal of fetal blood. The 
fetal loss rate attributable to PUBS is low, but it is slightly 
higher than that of amniocentesis or CVS. 

There are three primary applications of PUBS. It is used 
for cytogenetic analysis of fetuses with structural anomalies 
detected by ultrasound when rapid diagnosis is required. 
Cytogenetic analysis from fetal blood sampling is completed 
in 2 to 3 days, whereas diagnosis after amniocentesis can 
require 10 to 12 days if amniocytes must be cultured. This 
time difference can be critical in the later stages of a preg¬ 
nancy. A second application is diagnosis of hematological 
diseases that are analyzed most effectively in blood samples 
or diagnosis of immunologic disorders such as chronic gran¬ 
ulomatous disease (see Chapter 9). PUBS is also used to make 
a rapid distinction between true fetal mosaicism and false 
mosaicism caused by maternal contamination of an amniotic 
fluid sample. 

Percutaneous umbilical blood sampling (PUBS, 
or cordocentesis) is a method of direct sam¬ 
pling of fetal blood and is used to obtain a 
sample for rapid cytogenetic or hematological 
analysis or for confirmation of mosaicism. 
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Ultrasonography 

Technological advances in real-time ultrasonography have 
made this an important tool in prenatal diagnosis. A trans¬ 
ducer placed on the mother’s abdomen sends pulsed sound 
waves through the fetus. The fetal tissue reflects the waves in 
patterns corresponding to tissue density. The reflected waves 
are displayed on a monitor, which allows real-time visualiza¬ 
tion of the fetus. Ultrasonography can help to detect many fetal 
malformations, and it enhances the effectiveness of amniocen¬ 
tesis, CVS, and PUBS. Box 13-6 lists some of the congenital 
malformations that are diagnosable by fetal ultrasound. 

Ultrasonography is sometimes used to test for a specific 
condition in an at-risk fetus (e.g., a short-limb skeletal dys¬ 
plasia). More often, fetal anomalies are detected during the 
evaluation of obstetrical indicators such as uncertain gesta¬ 
tional age, poor fetal growth, or amniotic fluid abnormalities. 
Second-trimester ultrasound screening has become routine 
in developed countries. Studies of ultrasound screening 
suggest that sensitivity for the detection of most major con¬ 
genital malformations ranges from 30% to 50%. Specificity, 
however, approaches 99%. 

The sensitivity of ultrasonography is higher for some con¬ 
genital malformations. In particular, ultrasound can detect 


BOX 1 3-6 Selected Disorders Diagnosed 
by Ultrasound in the Second Trimester 


Symptom Complex 

Hydrops 

Oligohydramnios 

Polyhydramnios 

Intrauterine growth retardation 

Central Nervous System 

Anencephaly 

Encephalocele 

Holoprosencephaly 

Hydrocephalus 

Chest 

Congenital heart disease 
Diaphragmatic hernia 

Abdomen, Pelvis 

Gastrointestinal atresias 

Gastroschisis 

Omphalocele 

Renal agenesis 

Cystic kidneys 

Hydronephrosis 

Skeletal System 

Limb deficiencies 

Many chondrodystrophies, including thanatophoric dysplasia 
and osteogenesis imperfecta 

Craniofacial 

Cleft lip 

*Detection rate varies by condition. 


virtually all fetuses with anencephaly and 85% to 90% of 
those with spina bifida (Fig. 13-6). It also sometimes identi¬ 
fies a fetus with a chromosome abnormality by detecting a 
congenital malformation, intrauterine growth retardation, 
hydrops (abnormal accumulation of fluid in the fetus), or an 
alteration of the amniotic fluid volume. 

Ultrasonography is the technique used most commonly for 
fetal visualization, but other techniques are also used. Radiog¬ 
raphy is still used occasionally, for example, to evaluate a fetus 
for skeletal defects. Magnetic resonance imaging (MRI) offers 
much greater resolution than ultrasonography and is becom¬ 
ing more widely available for prenatal screening. 

Prenatal diagnosis includes invasive techniques 
designed to analyze fetal tissue (CVS, amnio¬ 
centesis, PUBS) and noninvasive procedures 
that visualize the fetus (ultrasonography, MRI). 

Maternal Serum Screening in the First 
and Second Trimesters 

Soon after the link between elevated amniotic fluid AFP and 
NTDs was recognized, an association between increased 



FIG 13-6 A, Photograph of an ultrasound result, revealing a 
fetus with a normal spinal column. B, Ultrasound result for a 
fetus with a meningomyelocele, visible as fluid-filled sacs 
(arrows) located toward the base of the spinal column. 
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levels of maternal serum AFP (MSAFP) and NTDs was iden¬ 
tified. AFP diffuses across the fetal membranes into the moth¬ 
er’s serum, so MSAFP levels are correlated with amniotic 
fluid AFP levels. Thus, it is possible to measure amniotic fluid 
AFP noninvasively by obtaining a maternal blood sample at 
15 to 17 weeks post-LMP. 

Because 90% to 95% of NTD births occur in the absence 
of a family history of the condition, a safe, noninvasive popu¬ 
lation screening procedure for NTDs is highly desirable. 
However, there is considerable overlap of MSAFP levels in 
women carrying a fetus with an NTD and those carrying an 
unaffected fetus (Fig. 13-7). Thus, the issues of sensitivity and 
specificity must be considered. Typically, an MSAFP level is 
considered to be elevated if it is 2 to 2.5 times higher than the 
normal median level (adjustments for maternal weight, pres¬ 
ence of diabetes mellitus, and ancestry are included in these 
calculations). Approximately 1% to 2% of pregnant women 
exhibit MSAFP levels above this cutoff level. After adjusting 
for advanced gestational age, fetal demise, and presence of 
twins, about 1 in 15 of these women has an elevated amniotic 
fluid AFP. The positive predictive value of the MSAFP screen¬ 
ing test is thus rather low, approximately 6% (1 /15). However, 
the sensitivity of the test is fairly high: MSAFP screening 
identifies approximately 90% of anencephaly cases and about 
80% of open spina bifida cases. Although this sensitivity level 
is lower than that of amniotic fluid AFP testing, MSAFP 
measurement poses no risk of fetal loss and serves as an effec¬ 
tive screening measure. Women who have an elevated MSAFP 
may choose to undergo diagnostic amniocentesis to deter¬ 
mine whether they are in fact carrying a fetus with an NTD. 

An association was found in the 1990s between low MSAFP 
and the presence of a fetus with Down syndrome. Previously, 
population screening for Down syndrome consisted of 
amniocentesis for women older than 35 years. Although 
highly accurate, this screening strategy has a sensitivity of 
only 20%: because the great majority of births occur in 
women younger than 35 years, only about 20% of all babies 



Maternal serum a-fetoprotein (multiples of median) 

FIG 13-7 Maternal serum a-fetoprotein (MSAFP) levels in 
mothers carrying normal fetuses and in mothers carrying 
fetuses with Down syndrome and open spina bifida. MSAFP 
is somewhat lowered when the fetus has Down syndrome, 
and it is substantially elevated when the fetus has an open 
spina bifida. (From Milunsky A. Genetic Disorders and the 
Fetus: Diagnosis, Prevention, and Treatment, 4th ed. Balti¬ 
more: Johns Flopkins University Press; 1998.) 


with trisomy 21 are born to mothers older than age 35. 
MSAFP measurement has expanded the option of population 
screening for Down syndrome. 

MSAFP levels overlap considerably in normal and Down 
syndrome pregnancies. The risk for Down syndrome in women 
younger than 35 years increases by a factor of 3 to 4 when the 
adjusted MSAFP value is lower than 0.5 multiples of the 
normal population median (see Fig. 13-7). In deriving a risk 
estimate, complex formulas take into account the mother’s 
weight, age, and MSAFP level. A woman who is 25 years of age 
ordinarily has a risk of about 1/1250 for producing a fetus with 
Down syndrome, but if she has a weight-adjusted MSAFP of 
0.35 multiples of the median, her risk increases to 1/171. This 
risk is higher than that of a 35-year-old woman in the general 
population. Most screening programs use a risk factor of 1/380 
(equivalent to the average risk for a 35-year-old woman to 
produce a newborn with Down syndrome) as an indication for 
subsequent diagnostic evaluation by amniocentesis. 

The accuracy of Down syndrome screening can be 
increased by measuring the serum levels of unconjugated 
estriol, human chorionic gonadotropin, and inhibin-A in 
addition to MSAFP (the quadruple screen). Although 
MSAFP alone identifies only about 40% of Down syndrome 
pregnancies, the four indicators together can identify approx¬ 
imately 80% (with a false-positive rate of 5%). The quadruple 
screen can also detect most cases of trisomy 18. 

First-trimester maternal serum screening (at 10 to 13 
weeks) for Down syndrome is being used increasingly in the 
United States and Europe. Three of the most useful measure¬ 
ments are the free [3 subunit of human chorionic gonadotro¬ 
pin (F(3hCG), pregnancy-associated plasma protein A 
(PAPP-A), and an ultrasound assessment of nuchal translu- 
cency (NT, the abnormal accumulation of fluid behind the 
neck of a fetus). Measurement of these three quantities in the 
first trimester enables detection of 80% to 85% of Down 
syndrome cases (with a false-positive rate of 5%, or 95% 
specificity). Combining first- and second-trimester screen¬ 
ing, called integrated screening, increases the sensitivity of 
Down syndrome detection to approximately 90%, with 95% 
specificity. Measurement of F(3hCG and PAPP-A is also useful 
for detection of trisomy 13 and trisomy 18 in the first trimes¬ 
ter. These screening results may be combined with CVS or 
amniocentesis to provide a more precise diagnostic test. 

MSAFP provides a screening approach that 
increases the prenatal detection of fetuses 
with various abnormalities, including NTDs, 
trisomy 18, and Down syndrome. This nonin¬ 
vasive procedure entails virtually no risk, but 
its sensitivity and specificity for detecting 
NTDs are lower than those of amniotic AFP 
diagnosis. Use of additional markers (e.g., the 
quadruple screen) in the second trimester 
increases sensitivity for detecting Down syn¬ 
drome. Screening of maternal serum for Down 
syndrome, trisomy 13, and trisomy 18 can be 
carried out in the first trimester. 
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Analysis of Fetal DNA in Maternal Circulation 

The ability to isolate fetal cells and DNA from maternal blood 
during pregnancy has opened up new opportunities for pre¬ 
natal screening and diagnosis. This approach, called nonin- 
vasive prenatal screening (NIPS), has become widely used in 
recent years, and some authorities think it will replace the 
current first- and second-trimester screening approaches. 
During pregnancy, a small number of fetal cells cross the 
placental barrier to enter the mother’s circulation as early as 
6 to 8 weeks post-LMP. These cells can be analyzed using 
techniques such as FISH or PCR, but isolating fetal cells spe¬ 
cifically in maternal blood has been a challenge. 

Recently, researchers have focused on fragments of cell- 
free DNA (cfDNA) released into the maternal circulation 
from trophoblasts that undergo programmed cell death. This 
cfDNA represents the fetal genome and composes a signifi¬ 
cant proportion (5% to 50%) of all cfDNA found in maternal 
plasma. This proportion tends to increase during pregnancy, 
and at approximately 9-10 weeks post-LMP a small quantity 
of maternal blood (10 mL) can be sampled to obtain cfDNA. 
Fetal cfDNA is differentiated from maternal cfDNA by iden¬ 
tifying SNP haplotypes that occur in excess frequency in the 
maternal circulation. cfDNA has been used to identify the sex 
of the fetus and its Rh blood type (especially important if the 
mother is Rh-negative and the fetus may be Rh-positive; see 
Chapter 9). High-throughput sequencing techniques can be 
used with fetal cfDNA, and full fetal genome sequences have 
now been obtained. Sequencing read counts (see Chapter 8) 
can reveal aneuploidy (e.g., trisomy 21 would produce a 50% 
excess in reads from chromosome 21). Recent studies indicate 
that maternal plasma cell-free DNA has at least several-fold 
higher specificity and positive predictive value for detection 
of trisomies 13,18, and 21 than does standard screening. The 
sensitivity of the test for aneuploidy detection is virtually 
100%. It is important to stress that NIPS is a screening test, 
not a diagnostic test, so positive test results are confirmed by 
CVS or amniocentesis. By decreasing the number of false 
positive tests, current NIPS techniques can reduce the need 
for invasive tests such as amniocentesis and CVS. 

Cell-free DNA (cfDNA) that enters the maternal 
circulation can be evaluated for mutations 
or aneuploidy using high-throughput sequenc¬ 
ing. This procedure entails no risk of fetal 
loss and has greater specificity and positive 
predictive value for aneuploidy detection than 
other first- and second-trimester screening 
procedures. 

Preimplantation Genetic Diagnosis 

The most common type of preimplantation genetic diag¬ 
nosis (PGD) is carried out on a blastomere obtained in the 
course of in vitro fertilization. Diagnosis is begun 3 days 
after fertilization, when the embryo contains six or eight 
cells. One or two blastomeres can safely be removed from 


the embryo. FISH analysis of the cell (see Chapter 6) can be 
used to diagnose aneuploidy. Also, the cell’s DNA can be 
amplified using PCR, permitting the diagnosis of single-gene 
diseases. If the embryo from which the blastomere was 
obtained is morphologically normal and neither the disease- 
causing mutation nor aneuploidy is detected, the embryo is 
implanted into the mother’s uterus. Testing protocols have 
been developed for dozens of genetic diseases (e.g., cystic 
fibrosis, Tay-Sachs disease, (J-thalassemia, myotonic dystro¬ 
phy, Huntington disease, Duchenne muscular dystrophy), 
and thousands of normal babies have been born after blasto¬ 
mere diagnosis. 

An occasional problem with blastomere PGD is that one 
of the two alleles of a locus may be undetectable, which could 
cause a heterozygote to appear as a homozygote. This phe¬ 
nomenon, termed “allelic dropout,” occurs because of partial 
PCR amplification failure when using DNA from only a 
single cell. This has led to misdiagnosis in a small number of 
cases, and several methods are used to increase accuracy. For 
example, highly heterozygous STRs closely linked to the 
disease-causing locus can be tested as part of the PCR analy¬ 
sis. If only one of the parents’ STR alleles can be observed in 
the blastomere’s DNA, it is likely that allelic dropout has also 
occurred for the disease-causing locus. The testing of two 
cells, rather than one, helps to avoid allelic dropout. 

PGD can also be carried out at the 100-cell blastocyst 
stage, using cells from the trophoectoderm of the blastocyst. 
This procedure has the advantage that a larger collection of 
cells is analyzed, helping to avoid allelic dropout. A disadvan¬ 
tage is that extraembryonic tissue (the trophoectoderm) is 
diagnosed, rather than the embryo itself. 

Polar body diagnosis involves an examination of the first 
or second polar body produced along with the ovum (see 
Chapter 2). The polar body’s DNA is tested to determine 
whether it contains a disease-causing mutation. If it does, it 
is assumed that the egg does not contain the mutation. This 
egg is then fertilized and implanted using the usual in vitro 
techniques. Because only the polar body is examined, pater¬ 
nal mutations cannot be evaluated. Polar body diagnosis is 
thus most useful when only the mother is at risk for transmit¬ 
ting a disease-causing mutation or when testing for aneu¬ 
ploidy (because most aneuploidies are contributed by the 
mother [see Chapter 6]). 

PGD is most commonly used by couples who have resorted 
to in vitro fertilization and wish to test for diagnosable genetic 
conditions. It can also be useful for couples who want prena¬ 
tal diagnosis but would not consider a pregnancy termina¬ 
tion. PGD, however, is costly and technically challenging, and 
its availability is still limited. 

Preimplantation genetic diagnosis can be 
carried out on polar bodies, blastomeres, or 
blastocyst cells, on which PCR analysis and/or 
FISH is performed. Diagnosis of genetic condi¬ 
tions permits implantation of only unaffected 
embryos and avoids the issue of pregnancy 
termination. 
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FETAL TREATMENT 

A potential goal of prenatal diagnosis is treatment of the 
affected fetus. Although this is not currently possible for most 
conditions, some examples can be cited. Many of these pro¬ 
cedures are experimental. 

Two of the best-established forms of in utero intervention 
are treatment for rare inborn errors of metabolism and treat¬ 
ment for hormone deficiencies. An important example of a 
treatable biochemical disorder is biotin-responsive multiple 
carboxylase deficiency, an autosomal recessive condition that 
can be diagnosed by amniocentesis. In one case report, oral 
administration of biotin to the mother was initiated at 23 
weeks of pregnancy and resulted in the birth of a normal baby. 

CAH is a second example of a condition for which in utero 
treatment has been successful after prenatal diagnosis. 
Because of excessive androgen secretion by the enlarged fetal 
adrenal glands, female fetuses with CAH become masculin¬ 
ized. Administration of dexamethasone to the mother begin¬ 
ning at 10 weeks post-LMP diminishes or prevents this 
masculinization. 

Surgical treatment of fetuses, primarily for conditions 
involving urinary tract obstruction, has met with moderate 
success. Surgical correction of diaphragmatic hernia at 20 
weeks’ gestation has also been attempted, but results have 
been discouraging and this approach has been abandoned. 
Clinical trials have shown that surgical closure of myelome¬ 
ningocele (spina bifida) at approximately 26 weeks gestation 
helps to restore the normal flow of cerebrospinal fluid and 
can reduce hydrocephalus. However, the procedure also 
increases the likelihood of preterm birth and maternal com¬ 
plications. Some success has been achieved in transplantation 
of hematopoietic stem cells into fetuses with X-linked severe 
combined immune deficiency (see Chapter 9). 

GENE THERAPY 

As we have seen, gene discovery provides opportunities for 
improved understanding and diagnosis of many diseases. The 
identification of these genes also leads to the possibility of 
genetic alteration of the cells of affected persons (gene 
therapy). Although gene therapy is still in its infancy and has 
only begun to affect the lives of patients, its potential for 
curing genetic diseases has excited a great deal of interest in 
both professional and lay circles. Approximately 2000 gene 
therapy protocols involving more than 100 different genes 
have been approved for clinical trials (see Table 13-8 for some 
examples). In this section we review gene therapy techniques 
and discuss their application in the treatment of disease. 

Somatic Cell Therapy 

Somatic cell gene therapy, which has been the focus of gene 
therapy research in humans, consists of the alteration of 
genes in human somatic cells to treat a specific disorder. The 
patient’s cells are extracted and manipulated outside the body 
{ex vivo therapy), or in some cases the cells are treated while 
they are in the body {in vivo therapy). 


Some types of somatic cells are more amenable to gene 
therapy than others. Good candidates should be easily acces¬ 
sible and should have a long life span in the body. Proliferat¬ 
ing cells are preferred for some gene-delivery systems, 
because the vector carrying the gene can then integrate into 
the cells’ replicating DNA. The bone marrow stem cell meets 
all of these qualifications and thus has been a prime candi¬ 
date for somatic therapy. Although these cells are difficult to 
manipulate and to isolate from bone marrow (the great 
majority of bone marrow cells are not stem cells), they have 
now been successfully isolated and genetically altered in 
many gene therapy treatments. Other cell types have also 
been investigated as potential targets, including skin fibro¬ 
blasts, muscle cells, vascular endothelial cells, hepatocytes, 
and lymphocytes. A disadvantage of using such cells is that 
their life span may be relatively short. Thus, therapy using 
them can require repeated treatment and administration of 
genetically altered cells. 

Gene Replacement Therapy 

Most current gene therapy techniques involve replacing a 
missing gene product by inserting a normal gene into somatic 
cells. This approach is best suited to correcting loss-of-func- 
tion mutations that result in a nonfunctional or missing gene 
product; inserting the normal gene supplies the missing 
product. Even a partially effective gene therapy strategy, pro¬ 
ducing perhaps 5% to 20% of the normal amount of the gene 
product, can provide significant health benefits. 

There are many techniques for introducing genes into 
cells, but viruses, having naturally evolved clever strategies for 
inserting their genetic material into cells, are the most com¬ 
monly used gene therapy vectors. In the following paragraphs 
viral vectors are discussed first, followed by a discussion of 
some potentially effective nonviral delivery systems. 

Retroviral Vectors 

Retroviruses, a form of RNA virus, can insert copies of their 
genomes into the nuclei of host cells after reverse- 
transcribing their viral RNA into double-stranded DNA (see 
Chapter 11). The insertion of foreign DNA into a host cell 
via a viral vector is termed transduction. Retroviruses trans¬ 
duce host cells with a high degree of efficiency, and they 
seldom provoke immune responses, making them a reason¬ 
able choice as a gene-delivery vector (Fig. 13-8). Recombi¬ 
nant DNA techniques are used to create replication-defective 
retroviruses in which the three retroviral protein-coding 
genes are replaced with a normal copy of a human gene and 
a promoter element (the “insert,” which can be as large as 
8-12 kb in a retrovirus). The modified retroviruses are then 
incubated with the patient’s somatic cells (e.g., bone marrow 
stem cells, lymphocytes) so that the retrovirus transduces 
the normal human gene into the DNA of the host cells. 
Ideally, the inserted gene will then encode a normal gene 
product in the patient’s somatic cells. This type of protocol 
has been used experimentally with many diseases, including 
forms of severe combined immunodeficiency (Clinical 
Commentary 13-5). 
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FIG 13-8 Gene therapy using a retroviral vector. The retrovi¬ 
rus is prevented from replicating by removal of most of its 
genome, and a normal human gene is inserted into the ret¬ 
rovirus. Incubation with human somatic cells allows the ret¬ 
rovirus to insert copies of the normal human gene into the 
cell. Once integrated into the cell's DNA, the inserted gene 
produces a normal gene product. 


Although retroviruses offer the advantages of stable 
and efficient integration into the genome, they also present 
specific disadvantages. Because retroviruses integrate prefer¬ 
entially near promoter sequences, a retroviral vector can 
insert near a proto-oncogene, activating it and thus causing 
tumor formation. Most types of retrovirus can enter the 
nucleus only when its membrane dissolves during cell divi¬ 
sion, so they can transduce only dividing cells and are inef¬ 
fective in nondividing or slowly dividing cells (e.g., neurons). 
Although this attribute is typically a disadvantage, it can be 
useful when the goal of therapy is to target only dividing cells 
and to avoid nondividing cells (e.g., in the treatment of a 
brain tumor, where tumor cells are dividing but nearby 
healthy neurons are not). 


Adenoviral Vectors 

Because of the inability of most retroviruses to transduce 
nondividing cells, other delivery systems have been explored 
that are not limited in this way. An important example is the 
adenovirus, a double-stranded DNA virus that is often used 
in vaccine preparations. In addition to its ability to transduce 
nondividing cells, the adenoviral vector can now be designed 
to accept inserts of approximately 36 kb in size. Adenoviruses 
do not integrate into the host cell’s DNA, which provides the 
advantage that they will not activate a proto-oncogene or 
otherwise disturb the genome. However, lack of integration 
is also a disadvantage because adenoviruses are eventually 
inactivated. This often results in transient gene expression 
(though long-term expression is sometimes achieved) and 
can require readministration of the vector. Because only part 
of the adenovirus genome is typically removed, the vector 
often provokes an immune response (e.g., inflammatory 
responses in the airways of cystic fibrosis patients in whom 
adenoviruses were used to introduce normal copies of the 
CFTR gene into airway epithelial cells). This problem increases 
with repeated introduction of the adenovirus, which stimu¬ 
lates further immune response to the foreign protein. Current 
research is focusing on “gutless” adenoviruses, in which nearly 
all of the viral genome is removed to reduce the immune 
response and increase the potential size of the insert. 

Adeno-Associated Viral Vectors 

Adeno-associated viruses (AAVs) are a type of parvovirus 
that requires the presence of adenoviruses for their normal 
replication (hence the term adeno-associated). Like adenovi¬ 
ruses, AAVs are DNA viruses that can transduce nondividing 
cells. In addition, they elicit much less immune response than 
do adenoviruses and have little, if any, pathogenic effect. They 
are also capable of sustaining protracted therapeutic expres¬ 
sion (months to years). These vectors, however, can accept a 
DNA insert of only about 4.5 kb. (In some cases, this problem 
can be circumvented by dividing the insert into two parts, 
placing each part in a vector, and designing the mRNA prod¬ 
ucts to reassemble.) Because of their many useful properties, 
AAVs have become much more popular as a gene therapy 
vector during the past several years. They have been tested in 
clinical trials for the treatment of cystic fibrosis, hemophilia B, 
tti-antitrypsin deficiency, Duchenne muscular dystrophy, Par¬ 
kinson disease, Alzheimer disease, and many other disorders. 

Lentiviral Vectors 

Lentiviral viruses are complex RNA retroviruses that, unlike 
simple retroviruses, can transduce nondividing cells through 
pores in the nuclear membrane (human immunodeficiency 
virus [HIV] is an example of a lentivirus). Like other retro¬ 
viruses, lentiviruses can integrate stably into the genome, and 
they can accept reasonably large inserts (8 kb). There is no 
evidence that lentiviruses integrate preferentially near genes, 
as most retroviruses do. Because they combine the desirable 
properties of greater safety, stable integration, and the ability 
to transduce nondividing cells, lentiviruses are currently the 
focus of much research and development. 
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CLINICAL COMMENTARY 13-5 

Gene Therapy and Severe Combined Immunodeficiency 

Gene therapy has been attempted for several forms of severe 
combined immunodeficiency (SCID), including adenosine 
deaminase deficiency SCID (ADA-SCID) and X-linked SCID. 
ADA, which is produced primarily in lymphoid tissues, is an 
important component of the purine salvage pathway. ADA 
deficiency, an autosomal recessive disorder that accounts for 
about 15% of SCID cases, results in the abnormal accumula¬ 
tion of purine metabolites toxic to T lymphocytes. Subse¬ 
quently, B lymphocytes are also reduced in function and 
number. The resulting SCID is usually fatal by the age of 2 
years if untreated. 

The preferred treatment for ADA-SCID is bone marrow trans¬ 
plantation. However, complications of bone marrow transplan¬ 
tation increase patient morbidity and are sometimes fatal. 
In addition, major histocompatibility complex (MHC)-compati- 
ble sibling donors are available for less than 30% of ADA- 
SCID patients. Patients may be treated with polyethylene 
glycol (PEG)-conjugated ADA (administered once or twice 
weekly by intramuscular injection), but the response to this 
treatment is variable, and some patients develop antibodies 
against PEG-ADA. 

Because it is a systemic disorder caused by an enzyme defi¬ 
ciency, ADA-SCID represents a good candidate for gene 
replacement therapy. Ideally, proliferating bone marrow stem 
cells would be modified by retroviral vectors containing the 
normal ADA gene, resulting in a permanent cure for the disor¬ 
der. Because of difficulties in dealing with bone marrow stem 
cells, gene therapy for ADA-SCID was initiated instead in 1990 
by retroviral insertion of ADA genes into lymphocytes that 
were extracted from patients. After retroviral insertion, the 
lymphocytes were injected back into the patients' peripheral 
circulation. This was the first application of gene therapy to an 
inherited human disorder. More recently, ADA-SCID has been 
treated by retroviral insertion of the ADA gene into bone 
marrow stem cells, rather than lymphocytes. This treatment 
has produced long-term increases in B-cell and T-cell counts 
and normal immune function in treated patients. 

X-linked SCID results from mutations in the gene, SCIDXl, 
that encodes subunits of the j chain found in six different 
cytokine receptors (those of interleukins 2, 4, 7, 9, 15, and 21; 


see Chapter 9). Lacking these receptors, T cells and natural 
killer cells cannot receive the signals they need for normal 
maturation. The T-cell deficiency in turn produces a deficiency 
of normal B cells, resulting in SCID. As with ADA deficiency, 
this disorder can be treated with bone marrow transplantation 
if an MHC-compatible donor is available. Without a bone 
marrow transplant, the disease is fatal early in childhood. 

In 1999, retroviral therapy was initiated to introduce SCIDXl 
into patients' bone marrow stem cells. Fewer than 1 % of bone 
marrow stem cells were effectively transduced with the thera¬ 
peutic gene. However, the transduced cells enjoyed a selective 
growth advantage over other bone marrow stem cells because 
the inserted gene increased the cytokine signaling needed for 
normal cell function. In most treated patients, the number of 
natural killer cells, T cells, and B cells increased to near-normal 
levels, with sustained resistance to infections continuing for 
more than a decade after therapy. 

These positive outcomes in most ADA and X-linked SCID 
patients have been widely heralded as the first successful uses 
of somatic cell gene therapy in the treatment of an inherited 
disease. However, five X-linked SCID patients developed 
leukemia-like disease (clonal T-cell proliferation) as a result of 
random insertion of the retroviral vector in or near LM02, a 
proto-oncogene that is activated in about half of all cases of 
acute lymphocytic leukemia. This was fatal in one patient, but 
the others were treated successfully by chemotherapy and 
continued to benefit from gene therapy. Leukemia has also 
occurred in patients with chronic granulomatous disease and 
Wiskott-Aldrich syndrome (Table 13-8) who have been treated 
with retroviral therapy. It is likely that the promoter and enhancer 
sequences in the retroviral vector up-regulate the LM02 gene. 
Self-inactivating retroviral vectors, in which enhancer sequences 
are deleted, are now being tested in clinical trials. 

These examples illustrate some of the promise, as well as 
some of the perils, of somatic cell gene therapy. Clearly, gene 
therapy poses hazards that must be closely monitored. 
However, these protocols can lead to effective treatment of 
otherwise lethal diseases, and they have provided invaluable 
information for the development of gene therapy protocols for 
other genetic diseases. 


TABLE 13-8 Examples of Diseases in Which Somatic Cell Gene Therapy Has Had 
Significant Therapeutic Effect 

DISEASE 

TARGET CELL 

PRODUCT OF INSERTED OR MODIFIED GENE 

Severe combined 

Circulating lymphocytes. 

Adenosine deaminase 

immunodeficiency (SCID) due to 

bone marrow stem cells 


adenosine deaminase deficiency 

x-linked SCID 

Bone marrow stem cells 

y subunit of interleukin receptors 

Hemophilia B 

Hepatocytes, skin fibroblasts 

Factor IX 

Duchenne muscular dystrophy 

Myocytes 

Dystrophin (modified by exon skipping) 

Wiskott-Aldrich syndrome 

Hematopoietic stem cells 

Wiskott-Aldrich Syndrome Protein (WASP) 

(primary immunodeficiency) 

X-linked adrenoleukodystrophy 

Hematopoietic stem cells 

Adenosine triphosphate-binding cassette transporter 

Leber congenital amaurosis 

Rod photoreceptor cells 

Retinal pigment epithelium-specific protein 65 

(childhood blindness) 

P-Thalassemia 

Hematopoietic stem cells 

P-Globin 

Lipoprotein lipase deficiency 

Myocytes 

Lipoprotein lipase 













285 


CHAPTER 13 Genetic Testing and Gene Therapy 


Challenges in Viral Gene Therapy 

Although viral gene therapy holds considerable promise, 
there are several important challenges: 

• Transient and low-level expression. The gene product may 
be expressed at sub therapeutic levels, often less than 1% 
of the normal amount. In part, this reflects the fact that 
only some of the target cells successfully incorporate the 
normal gene. In addition, random insertion of the virus 
into the host’s genome can affect gene regulation (e.g., 
some of the regulatory sequences required for normal 
expression levels are not present). Cells sometimes respond 
to foreign inserted DNA by methylating—and thus 
inactivating—it. For these reasons, transcription of the 
gene often ceases after a few weeks or months. It should 
be noted, however, that transient expression is sufficient, 
and even desirable, for some types of therapy, such as 
provoking an immune response against a tumor or gener¬ 
ating new blood vessels (discussed later). 

• Difficulties in reaching or specifying target tissue. Although 
some systemic disorders are relatively easy to target by 
modifying lymphocytes or bone marrow stem cells, others 
present formidable challenges. It may be difficult, for 
example, to target affected neurons responsible for central 
nervous system disorders. In addition, vectors must be 
modified so that they enter only the desired cell type. 

• Necessity for precise regulation of gene activity. Accurate 
regulation of gene activity is not a concern for some dis¬ 
eases (e.g., a 50-fold overexpression of adenosine deami¬ 
nase has no clinically significant effects). However, it is 
critical for diseases such as thalassemia, in which the 
number of a-globin and ji-globin chains must be closely 
balanced (see Chapter 3). ft is often difficult to achieve 
such precision using viral gene therapy. 

• Potential for insertional mutagenesis. The unpredictable 
integration of a retroviral vector into the host’s DNA can 
have undesired consequences, as discussed earlier. 
Although insertional mutagenesis appears to be a rare 
event, it has occurred in several patients (see Clinical 
Commentary 13-5). 

Considerable research is being devoted to overcoming 
these and other problems. For example, the levels and per¬ 
manence of gene expression are being increased by incorpo¬ 
rating stronger promoter sequences in DNA inserts. Vectors 
are being modified to reduce immune responses and to 
increase target-cell specificity. Methods are being developed 
for targeted insertion of corrected DNA sequences. For 
example, proteins are engineered to bind to a specific mutated 
DNA sequence and to induce double-stranded DNA breaks 
followed by the insertion of a normal DNA sequence. The 
CRISPR/Cas9 gene editing method (see Chapter 10) uses 
specific noncoding RNAs to guide the DNA-cutting Cas9 
nuclease to a specific location in the genome, where DNA 
bases are then removed or altered. With targeted insertion, 
the mutated DNA is corrected in situ, avoiding difficulties 
with random insertion of DNA and taking advantage of the 
native promoter and enhancer sequences in the host’s genome. 


Viral vectors offer highly efficient transfer of 
therapeutic genes into somatic cells. However, 
they have several drawbacks, including low or 
transient expression of the gene product, 
limited insert size, generation of immune 
responses, difficulty in precise regulation, and, 
for some vectors, a lack of ability to trans¬ 
duce nondividing cells and the potential for 
oncogenesis. 

Nonviral Vectors 

Although viral vectors provide the advantage of efficient gene 
transfer into cells, the disadvantages just listed have prompted 
researchers to investigate several types of nonviral vectors. 
One of the most extensively studied is the liposome, a fat 
body that can accept large DNA inserts. Liposomes some¬ 
times fuse with cells, allowing the DNA insert to enter the 
cell. Because the liposome has no peptides, it does not elicit 
an immune response. Its primary disadvantage is that it lacks 
the transfer efficiency of viruses: most of the liposomes are 
degraded in the cytoplasm, and most of those that are not 
degraded are unable to enter the nucleus. 

Surprisingly, it is possible to insert plasmids containing 
human DNA directly into cells without using any delivery 
vector at all. Although most “naked” DNA is repelled by the 
cell membrane, the DNA occasionally enters the cell, escapes 
degradation, and temporarily encodes proteins. Attempts are 
under way to use naked DNA as a vaccine that encodes a 
pathogenic protein against which the body mounts an 
immune response. 

An intriguing development with potential for somatic cell 
therapy is the synthesis of human artificial chromosomes. 
Because these synthetically constructed chromosomes contain 
functional centromeres and telomeres, they should be able to 
integrate and replicate in human cell nuclei. Also, they are 
capable of accepting inserts as large as the entire 2.4 Mb 
Duchenne muscular dystrophy gene (DMD). 

Gene therapy using nonviral vectors, including 
liposomes and naked DNA, offers some advan¬ 
tages over viral vectors, but they currently lack 
the transfer efficiency of viral vectors. 

Gene-Blocking Therapies 

Gene-replacement techniques are not effective in correcting 
gain-of-function or dominant negative mutations (e.g., Hun¬ 
tington disease, Marfan syndrome). To correct these condi¬ 
tions, the defective gene product must be blocked or disabled 
in some way. Although not as well developed as gene- 
replacement therapy methods, a number of gene-blocking 
methods are showing considerable therapeutic promise. 

Antisense Therapy 

The principle behind antisense therapy is simple: an 
oligonucleotide is engineered whose DNA sequence is 
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complementary to that of the messenger RNA (mRNA) 
sequence produced by a gain-of-function mutation. This 
antisense oligonucleotide binds to the abnormal mRNA, pre¬ 
venting its translation into a harmful protein (Fig. 13-9A). 
Antisense oligonucleotides can also be engineered to bind to 
the double-stranded DNA containing the disease-causing 
mutation, creating a triple helix that cannot be transcribed 
into mRNA. A difficulty with antisense therapy is that anti- 
sense oligonucleotides are often degraded before they can 


reach their target. Also, because of variation in the shape of 
the target DNA or RNA molecule, the antisense oligonucle¬ 
otide might not be able to bind to its complementary 
sequence. Nevertheless, antisense therapy is being tested in a 
number of experimental applications, including blocking 
KRAS oncogene (see Chapter 9) expression in pancreatic and 
colorectal tumor cells. 

Antisense oligonucleotides can also be used to induce alter¬ 
native splice sites in pre-mRNA (mRNA from which the 
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FIG 13-9 A, Gene therapy using an antisense technique. Binding of the abnormal mRNA by the 
antisense molecule prevents it from being translated Into an abnormal protein. B, Gene therapy 
using a hammerhead ribozyme, which binds to a mutated mRNA, cleaving and eliminating it. 
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introns have not yet been removed; see Chapter 2) to remove 
an exon that is affected by a frameshift mutation. This exon 
skipping approach has shown considerable promise, for 
example, in treating patients with Duchenne muscular dystro¬ 
phy, where a large proportion of cases are caused by frameshift 
insertions or deletions (see Chapter 5). The result is a DMD 
mRNA transcript in which one exon is missing but the remain¬ 
der of the transcript has an intact reading frame (the pheno¬ 
type is expected to be similar to that of the milder Becker 
muscular dystrophy). In phase III clinical trials, treated 
patients show substantial increases in dystrophin production. 

Ribozyme Therapy 

Ribozymes are enzymatic RNA molecules, some of which 
can cleave mRNA. They can be engineered to disrupt specific 
mRNA sequences that contain a mutation, destroying them 
before they can be translated into protein (see Fig. 13-9B). 
Ribozyme therapy is being tested, for example, as a method 
of countering the overexpression of epidermal growth factor 
receptor type 2, a feature of many breast tumors. 

RNA Interference 

A third method of gene blocking involves RNA interference 
(RNAi; Fig. 13-10), a natural phenomenon that evolved to 
defend cells against viral invasion. Because many viruses 
produce double-stranded RNA, cells of all multicellular 
organisms recognize this form of RNA and use an enzyme 
called dicer to digest it into small 20-bp pieces. These pieces 
are then used as a template to direct the destruction of any 
single-stranded RNA that has the same sequence as the 
double-stranded viral RNA (e.g., the single-stranded 
mRNA that the virus would use to encode viral proteins). 
By artificially synthesizing double-stranded RNA molecules 


that correspond to a disease-causing DNA sequence, RNAi 
can be induced to destroy the mRNA produced by the 
mutated sequence. 

RNAi faces challenges similar to those of antisense and 
ribozyme therapy, such as degradation of the RNA molecule 
before it reaches the target. This difficulty is being overcome 
by inserting RNAi molecules into lentiviral and adeno- 
associated viral vectors. RNAi has shown some promise in 
reducing, for example, the number of transcripts produced 
by oncogenic KRAS, and it has also been shown to block 
transcripts of the BCR-ABL fusion gene, which causes chronic 
myelogenous leukemia (see Chapter 1 1). It is being tested for 
the treatment of age-related macular degeneration, asthma, 
hepatitis C, and Huntington disease. 

Gene-blocking techniques may be used to 
counter the effects of dominant-negative or 
gain-of-function mutations. They include the 
use of antisense molecules, RNA-cleaving ribo¬ 
zymes, and RNA interference. 

Gene Therapy for Noninherited Diseases 

Gene therapy is by no means limited to inherited diseases. 
Indeed, about two thirds of the gene therapy protocols now 
under way involve noninherited cancers, and a significant 
proportion involve acquired immunodeficiency syndrome 
(AIDS) therapy. For example, the TP53 tumor suppressor 
gene, which is inactivated in about half of all cancers (see 
Chapter 11), has been inserted into lung tumors in an effort 
to halt tumor progression. As discussed in Chapter 9, some 
tumors evade immune system detection by discarding cell- 
surface molecules that are recognized by T cells. Liposomes 




FIG 13-10 Gene-blocking therapy using RNA interference (RNAi). A dicer cleaves double-stranded 
RNA (dsRNA) into 20-bp single-stranded RNA fragments called short interfering RNAs (siRNAs). 
These fragments form a template that is recognized and bound by the RNA-induced silencing 
complex (RISC), which cleaves and destroys the complementary RNA strand. In RNAi, a dsRNA 
is engineered to produce siRNA strands that are complementary to a mutated mRNA, causing 
the RISC complex to destroy the mRNA. 
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containing DNA that encodes the B7 costimulatory molecule 
(see Chapter 9) have been introduced into malignant mela¬ 
noma cells, which results in B7 expression on the cell surface 
and subsequent cell destruction by cytotoxic T cells. In some 
cases, this has led to regression of the melanoma. 

A variety of gene therapy approaches are being formulated 
to combat HIV. Most of these efforts are aimed at halting 
replication of the virus or preventing its spread to healthy 
ceUs. For example, a dominant-negative mutation introduced 
into HIV-infected T cells produces a protein that interferes 
with proteins produced by HIV, blocking their normal action. 
Trials are also in progress to reduce expression of CCR5, a 
chemokine coreceptor used by HIV to enter cells of the 
immune system (see Chapter 9). 

Another example of gene therapy for a noninherited 
disease involves the treatment of coronary artery disease. 
Copies of the genes that encode members of the vascular 
endothelial growth factor (VEGF) and fibroblast growth 
factor (FGF) families have been injected into ischemic myo¬ 
cardium (using viral vectors or as naked DNA) with the hope 
of producing new coronary vessels. 

Germline Therapy 

Somatic cell therapy consists of altering only specific somatic 
cells and thus differs little in principle from many other types 
of medical intervention (e.g., bone marrow transplantation). 
In contrast, germline gene therapy involves altering all cells 
of the body, including those that give rise to the gametes. 
Thus, this type of gene therapy would affect not only the 
patient but also his or her descendants. 

Germline therapy was first achieved in the mouse in 1983, 
when copies of a human growth hormone gene were success¬ 
fully introduced into mouse embryos by microinjection (the 
gene was inserted directly into the embryo using a very small 
needle). Among the minority of embryos in which the gene 
integrated, the gene was expressed and the mice grew to 
abnormally large size. The gametes were also modified, and 
the human growth hormone gene was transmitted to future 
generations. 


Although germline therapy is, in principle, possible in 
humans, it presents significant problems (Box 13-7). First, 
injected embryos usually die, and some develop tumors and 
malformations. Second, even in an autosomal dominant dis¬ 
order, half of the embryos produced by a heterozygous parent 
are genetically normal. If it were possible to distinguish the 
genetically normal embryos (e.g., through preimplantation 
genetic diagnosis), it would be simpler to implant the normal 
embryos than to alter the abnormal ones. Finally, numerous 
ethical questions are associated with the permanent altera¬ 
tion of a human’s genetic legacy. For these reasons, it appears 
unlikely that human germline therapy would be useful or 
desirable. 

Gene Therapy: A Perspective 

The majority of gene therapy protocols are still in phase I and 
phase II trials, but more than 100 are now in phase III clinical 
trials. (The NIH web site, https://clinicaltrials.gov/, is an 
excellent resource for reviewing the current and ongoing 
trials.) The past several years have witnessed the first arguable 
successes of gene therapy, some of which have been discussed 
in this chapter (successful therapy for X-linked SCID and 
ADA deficiency, hemophilia B, and other conditions listed in 
Table 13-8; evidence of therapeutic effects in various cancers). 
However, success has been achieved thus far in only a rela¬ 
tively small number of persons. 

Gene therapy is not without risk. In addition to the 
insertional mutagenesis potential already discussed, a young 
man with ornithine transcarbamylase deficiency (see 
Chapter 7) died in 1999 as a result of an adverse immune 
reaction to an adenovirus vector. In addition, retroviral 
therapy has occasionally resulted in insertional mutagenesis 
(see Clinical Commentary 13-5). Despite these reservations, 
gene therapy research is providing many new insights of 
fundamental biological significance. As with many avenues 
of biomedical research, the potential of gene therapy 
research is considerable, and current progress suggests 
strongly that it may provide efficacious treatment of some 
important human diseases. 


BOX 1 3-7 Germline Therapy, Genetic Enhancement, Human Cloning, and Embryonic 
Stem Cells: Controversial Issues in Medical Genetics 


For reasons outlined in the text, germline gene therapy is not 
being undertaken in humans. Nevertheless, germline gene 
therapy is in many ways technically easier to perform than is 
somatic cell therapy. Germline therapy also offers (in theory) 
the possibility of "genetic enhancement," the introduction of 
favorable genes into the embryo. However, a gene that is 
favorable in one environment may be unfavorable in another 
(e.g., the sickle cell mutation, which is advantageous only for 
heterozygotes in a malarial environment). And, because of plei- 
otropy, the introduction of advantageous genes can have com¬ 
pletely unintended consequences (e.g., a gene thought to 
enhance one characteristic could negatively affect another). For 
these reasons, and because germline therapy usually destroys 


the targeted embryo, neither germline therapy nor genetic 
enhancement is advocated by the scientific community. 

Controversy also surrounds the prospect of cloning humans. 
Many mammalian species (e.g., sheep, pigs, cattle, goats, 
mice, cats, dogs) have been successfully cloned by introducing 
a diploid nucleus from an adult cell into an egg cell from which 
the original haploid nucleus was removed (a technique termed 
somatic cell nuclear transfer, or SCNT; see the figure that 
follows). The cell is manipulated so that all of its genes can be 
expressed (recall that most genes in a typical differentiated 
adult cell are transcriptionally silent). This procedure, when 
allowed to proceed through a full-term pregnancy, could likely 
be used to produce a human being (reproductive cloning). 
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BOX 13-7 Germline Therapy, Genetic Enhancement, Human Cloning, and Embryonic 
Stem Cells: Controversial Issues in Medical Genetics—cont'd 


Some argue that human cloning offers childless couples the 
opportunity to produce children to whom they are biologically 
related or even to replace a child who has died. It is important 
to keep in mind, however, that a clone is only a genetic copy. 
The environment of the individual, which also plays a large role 
in development, cannot be replicated. Furthermore, the great 
majority of cloning attempts in mammals fail: in most cases, 
the embryo either dies or has gross malformations. Because 
the consequences of human reproductive cloning would almost 
certainly be similar, reproductive cloning of humans is con¬ 
demned almost universally by scientists. 

It is important to distinguish reproductive cloning from the 
cloning and cultivation of cells for therapeutic purposes. 
Embryonic stem cells (ESCs), which are derived from the 
inner-cell mass of blastocyst-stage embryos, can be cloned 
and have the unique potential to differentiate into any type of 
cell in the human body (pluripotency). For example, they can 
potentially form neurons for the treatment of Parkinson 
disease or cardiac myocytes for the treatment of ischemic 
heart disease. However, with current technology, the embryo 
is destroyed to obtain ESCs, and this is controversial in many 
circles. Ongoing research efforts are aimed at inducing pluri¬ 
potency in differentiated adult cells. Research is also under 
way to extract usable single cells from 3-day blastomere 
embryos (as in preimplantation genetic diagnosis) so that 
embryos are not destroyed. It remains to be seen whether 
these technologies can produce cells that have the same flex¬ 
ibility and utility as ESCs. 

One difficulty in using cells derived from ESCs is that they 
might induce an immune response in the recipient. This 
problem could be largely overcome if ESC clones were avail¬ 
able from many persons with different MHC types. The recipi¬ 
ent would then be immunologically matched to the appropriate 
ESC. However, only a limited number of ESC lines are currently 
available to most researchers. Another suggestion is that SCNT 
could be used with a patient's own cells to create ESCs that 
would be identical in DMA sequence to the patient. 

Although these technologies offer the hope of effective treat¬ 
ment for some recalcitrant diseases, they also present thorny 
ethical issues. Clearly, decisions regarding their use must be 
guided by constructive input from scientists, legal scholars, 
philosophers, and others. 


Mouse somatic cells 
cultured in medium that 
induces nonspecialization 
(pluripotency) 



Nucleus is removed 
from mouse egg cell 
(enucleation) 



enucleated egg cell 



Somatic cell nuclear transfer to create a mouse clone. A 


mouse diploid somatic cell (e.g., a fibroblast) is cultured and 
grown in media that cause it to become pluripotent. It is 
fused with an enucleated egg cell, creating a diploid one-cell 
embryo. This embryo is allowed to develop to a multicell 
stage and implanted in a mouse's uterus. The resulting 
mouse is genetically identical (a clone) to the mouse that 
provided the somatic cell. 


I STUDY QUESTIONS 


1. A newborn-screening program for a metabolic disease has 
just been initiated. Of 100,000 newborns, 100 were shown 
by a definitive test to have been affected with the disease. 
The screening test identified 93 of these neonates as 
affected and 7 as unaffected, ft also identified 1000 neo¬ 
nates as affected who were later shown to be unaffected. 
Calculate the sensitivity, specificity, and positive predictive 
value of the screening test, and specify the rate of false 
positives and false negatives. 

2. Study the family shown in the pedigree in Figure 13-11. 
Individual 3 has PKU, an autosomal recessive disease. A 


two-allele SNP closely linked to the PKU locus has been 
assayed for each family member, and the figure shows the 
genotypes of each individual. Based on the genotypes of 
the linked marker, is individual 6 affected, a heterozygous 
carrier, or a normal homozygote? 

3. Study the family shown in the pedigree in Figure 13-12. The 
affected individuals have neurofibromatosis type 1 (NFl), 
an autosomal dominant condition. A four-allele STR locus 
closely linked to the NFl locus has been typed for each 
family member. Based on the genotypes shown in the 
accompanying figure, will individual 6 develop NFl? 
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FIG 13-11 Pedigree to accompany Study Question 2. 



FIG 13-12 Pedigree to accompany Study Question 3. 


4. In the pedigree for an autosomal dominant disorder 
shown in Figure 13-13, a tightly linked two-allele SNP has 
been typed in each family member. Based on this infor¬ 
mation, what can you tell the family about the risk that 
the offspring in generation III will develop the disorder? 
How might diagnostic accuracy be improved in this 
case? 

5. Compare the advantages and disadvantages of amniocen¬ 
tesis and chorionic villus sampling (CVS). 

6. What type of gene therapy would be most appropriate for 
Huntington disease? Why? 



FIG 13-13 Pedigree to accompany Study Question 4. 
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CHAPTER 


Genetics and Precision Medicine 


Scientific, technological, and medical advances have made it 
possible to detect, diagnose, and treat most common diseases 
(e.g., asthma, diabetes, hypertension) early in their course 
and more effectively than ever before. Such advances are, 
however, highly dependent on the skills and knowledge of 
clinicians, access to health-care services, and the availability 
and affordability of diagnostic technologies. Most health-care 
providers follow a conventional model in which a patient 
presents with a set of symptoms and signs, which the provider 
uses to make a “most likely” diagnosis. The provider then 
prescribes a treatment that he or she thinks will be most 
effective. If this treatment fails, the process is repeated until 
a correct diagnosis or more effective treatment is found. In 
such a model, preventive health maintenance is encouraged. 
Compliance, however, is challenging because information 
about risk factors, as well as the patient’s perception of risk, 
is at best approximate. 

Precision or predictive medicine is a model of practice in 
which each person’s personal risk for rare and common con¬ 
ditions and the effectiveness of various treatments are esti¬ 
mated directly from his or her unique combination of genetic 
and environmental risk factors. Accordingly, a health-care 
provider can predict a person’s risk for common diseases, 
select diagnostic tests to confirm the presence of disease, and 
prescribe the best therapeutic regimen to treat it. Ideally, 
knowledge of disease risk promotes interventions (e.g., modi¬ 
fication of diet, choice of drug therapy) that not only can treat 
disease early in its course but also can delay its onset or 
prevent it altogether. 

The effectiveness of precision medicine depends on a 
number of factors. These include identifying genetic and 
environmental risk factors (and their interactions) that 
enable accurate prediction of clinically significant risk; dem¬ 
onstrating that individual risk assessment improves diagnos¬ 
tic accuracy and treatment outcome; developing technologies 
for cost-efficient assessment of a person’s genome; building 
an infrastructure for clinicians to access risk data, interpret 
risk information, and explain risk estimates to patients; and 
developing guidelines and policies for how risk assessment 
information should be used in clinical and research applica¬ 
tions. Not all of these aims will be achieved for every common 
disease. Indeed, for many complex diseases, it is likely that 
there will be no alternative to the conventional model of 
practice in the near future because so little is known about 
their etiology and pathophysiology. Nevertheless, for some 
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common diseases and drug responses, genetic testing and, in 
several cases precision medicine, are already being adapted to 
the clinical setting. 

In this chapter, we discuss how new technologies are 
making the assessment of individual human genomes widely 
accessible, how such genome-wide information is being used 
currently to make personal decisions about health, and the 
implications of precision care. 

Precision medicine is the use of each person's 
unique combination of genetic and environ¬ 
mental risk factors to make predictions 
about his or her disease risk and response to 
various treatments. 


A TECHNOLOGY-DRIVEN TRANSFORMATION 

Traditionally, the search for genetic variants that influence 
risk for mendelian conditions and common complex dis¬ 
eases has been a daunting task and has been one of the 
major obstacles to developing precision medicine. The most 
common approach to finding such variants involved testing 
whether polymorphisms in candidate genes were associated 
with disease risk in a small group of unrelated patients with 
the same phenotype (e.g., diabetes, obesity). This was prob¬ 
lematic, in part because choosing the most appropriate can¬ 
didate genes was difficult, small cohorts provided limited 
statistical power, and the process of genotyping or sequenc¬ 
ing was labor intensive and expensive. This situation has 
changed dramatically with the development of technologies 
to interrogate millions of polymorphisms per person 
cheaply and efficiently (Box 14-1) and more recently the 
introduction of exome and genome sequencing (see 
Chapter 8). These technologies, coupled with advances in 
statistics and computing, enabled the application of new 
approaches such as genome-wide association studies (see 
Chapter 8), as well as the study of much larger cohorts of 
tens or hundreds of thousands of persons. In addition, 
these new technologies for genotyping and DNA sequenc¬ 
ing have made it possible to develop cost-effective clinical 
tests that take advantage of newly discovered risk variants. 
The major challenge nowadays to implementing precision 
medicine is translating discoveries of risk variants into clin¬ 
ical practice. 
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BOX 14-1 Assessing Your Genome 


Knowledge of a person's genetic makeup will clearly be an 
important tool for making better decisions about health, 
medical care, and perhaps lifestyle as well. Until recently, 
assessing the genome as a whole was fairly expensive and 
done only in research laboratories. However, new technolo¬ 
gies have dramatically lowered the cost of whole-genome 
analysis and have spurred the development of consumer 
services that offer whole-genome studies directly to the 
public (see Chapter 13). These services have rapidly made 
headlines, as much for their novelty as for their potential to 
inform persons about their genetic composition. 

Most whole-genome consumer services offer to genotype 
millions of common single nucleotide polymorphisms 
(SNPs). The SNPs typed for consumers are the same ones 
commonly being used by researchers to identify disease- 
gene associations for common multifactorial disorders such 
as hypertension, diabetes, and obesity. As gene-disease 
associations are reported, consumers who have access to 
their genetic information can evaluate their risk for genetic 
diseases. Also, because each person's genetic data are per¬ 
manent, evaluation of risk can be reassessed with each new 
discovery. However, many of the SNP-disease associations 
reported to consumers are relatively weak and may be mis¬ 
understood or misinterpreted by the lay consumer (see 
Chapter 13). 

More recently, exome sequencing and whole-genome 
sequencing have been made available to the public. This 
service is still expensive and therefore very limited in applica¬ 
tion. It remains unclear whether a person's understanding of 
health-related risks will be increased by knowing about some 
or all of the genetic risk variants that they carry. Most of the 
caveats raised in the previous paragraph for whole-genome 
SNP typing apply equally to whole-genome sequencing. 
However, since risk variants for mendelian conditions are 
frequently detected by sequencing, some variants of poten¬ 
tial clinical utility are virtually always found. 


THE IMPACT OF GENOMICS _ 

Pharmacogenetics 

Many of the drinks (e.g., coffee, tea) and foods that we ingest 
each day contain thousands of complex compounds that each 
of us must process. Some of these compounds never leave the 
gastrointestinal tract, but most are absorbed, distributed, 
metabolized, and eliminated (i.e., biotransformed) to a 
variety of products that are used immediately, stored, or 
excreted. Exogenously synthesized compounds that are 
administered to achieve a specific effect on the human body 
(e.g., pharmaceuticals) also undergo biotransformation, and 
humans vary in the efficiency and speed with which they do 
this. Moreover, the response of a drug’s target (e.g., enzymes, 
receptors) can also vary among individuals. The study of the 
individual genetic variants that modify human responses to 
pharmacological agents is called pharmacogenetics; the 
assessment of the action of many genes simultaneously is 
called pharmacogenomics. 


Genetic Prediction of Serious Adverse 
Drug Responses 

Over the last decade, ambitious efforts have been undertaken 
to advance the knowledge of pharmacogenetics. This has 
been driven, in part, by the expectation that through the use 
of pharmacogenetics, we will be able to profile DNA differ¬ 
ences among individuals and thereby predict responses 
to different medicines. For example, a genetic profile (i.e., a 
summary of a person’s risk alleles) might predict who is more 
or less likely to respond to a drug or to suffer a serious 
adverse drug reaction (SADR). 

Many drugs have a response rate between 25% and 75%. For 
example, ACE inhibitors and beta blockers have been found to 
be ineffective or only partially effective in up to 70% of hyper¬ 
tensive patients. The use of such drugs in persons who are 
unlikely to respond increases the incidence of SADRs and adds 
to the burden of health-care costs. Yet, for most drugs, no tests 
are available to determine who will or will not respond, so these 
drugs are administered largely on a trial-and-error basis. 

Many drugs have adverse effects that are of clinical impor¬ 
tance, and of the approximately 1200 drugs approved for use 
in the United States, about 15% are associated with a signifi¬ 
cant incidence of SADRs. A widely cited analysis conducted 
in the mid-1990s suggested that nearly 2 million people are 
hospitalized each year as a result of adverse drug effects, and 
approximately 100,000 people die from them, even when the 
drugs are appropriately prescribed and administered. Studies 
in Europe and Australia have yielded similar results. Thus, 
identification of genetic profiles that predict a person’s 
response to drugs is likely to increase the overall efficacy and 
safety of pharmaceuticals. 

Testing is currently available for a handful of alleles that 
predict SADRs. For example, thiopurine methyl transferase 
(TPMT) is an enzyme that inactivates thiopurine drugs (e.g., 
6-mercaptopurine, azathioprine), which are frequently used 
to treat acute lymphatic leukemia and to prevent rejection of 
organ transplants. A mutation of the TPMT gene reduces 
enzyme activity. About one in 300 persons of European 
ancestry is homozygous for this mutation, and these patients 
can experience life-threatening bone marrow suppression 
upon exposure to thiopurine drugs. The presence of such 
variants can be assessed by genotyping or by enzyme assays, 
which are now commonly done before administering thiopu- 
rines. Large-scale exome sequencing studies have demon¬ 
strated that each person carries at least several alleles that 
alter drug metabolism compared to the general population. 

Each person's response to natural and syn¬ 
thetic chemicals is determined in part by poly¬ 
morphisms in genes that control pathways of 
biotransformation and the chemical's target. 

Individualized Drug Therapy 

One of the major challenges of pharmacogenetics is the selec¬ 
tion of appropriate targets (e.g., a specific enzyme, cytokine. 
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or cell-surface receptor) that might be amenable to manipu¬ 
lation by a drug. The results of genetic studies are used to 
identify polymorphisms associated with varying susceptibil¬ 
ity to disease (i.e., a potential target for a drug) or polymor¬ 
phisms that modify the human response to a drug. For 
example, long QT syndrome (LQT syndrome; see Chapter 12) 
can be caused by mutations in one of at least a dozen different 
genes whose protein products affect ion channel function in 
heart cells (e.g., sodium and calcium channels). Because 
sodium channels and calcium channels are blocked by 
different drugs, a person’s genetic profile can be used to 
choose the best drug for treatment of LQT syndrome. In 
this case, the relationship between disease and target is well 
characterized. 

Polymorphisms in genes that encode angiotensinogen, 
angiotensin-converting enzyme (ACE), and the angiotensin 
II type I receptor have been associated with differential 
responses to antihypertensive agents. For example, the ACE 
gene contains a 190-bp sequence that can be either present 
(the I allele) or deleted (the D allele). Persons who are homo¬ 
zygous for the D allele are more responsive to ACE inhibitors. 
Response to antihypertensive beta blockers has been associ¬ 
ated with polymorphisms in genes that encode subunits of 
the (J-adrenergic receptor (Table 14-1). None of these vari¬ 
ants are commonly tested prior to initiating antihypertensive 
therapy, but studies are under way to determine when such 
information, in conjunction with environmental risk factors 
such as smoking and diet, might facilitate the development 
of individualized treatment. 


Many of the physiological effects of variation in drug 
response have been known for decades. A deficiency of 
glucose-6-phosphate dehydrogenase (G6PD), which is esti¬ 
mated to affect more than 200 million people worldwide, 
causes increased sensitivity to the antimalarial drug, prima¬ 
quine, producing an acute hemolytic anemia. The metabo¬ 
lism of isoniazid (a drug commonly used to treat tuberculosis) 
is strongly influenced by an allele of the gene that encodes 
N-acetyltransacetylase 2 (NAT2), the enzyme that is used to 
acetylate, and thereby inactivate, isoniazid. Persons who are 
homozygous for this allele are known as slow inactivators and 
are at higher risk for developing side effects than persons who 
metabolize isoniazid more quickly. About half of persons of 
European or African ancestry are slow inactivators, but this 
figure is lower among East Asians. Succinylcholine is a drug 
widely used in anesthesia to induce short-term muscle paral¬ 
ysis. Typically, the effects of succinylcholine last only a few 
minutes before it is rapidly degraded in the plasma by circu¬ 
lating butyrylcholinesterase. Several alleles of the gene that 
encodes butyrylcholinesterase cause reduced enzyme activity. 
Persons who are homozygotes or compound heterozygotes 
for such alleles have a diminished ability to inactivate succi¬ 
nylcholine. This can result in prolonged paralysis and respira¬ 
tory failure that requires mechanical ventilation for up to 
several hours. 

In each of these examples, a person who has a relatively 
common allele might, upon exposure to a specific chemical, 
experience an unanticipated pharmacological effect. Variants 
have been discovered in enzymes that produce a much 


TABLE 

14-1 Examples of Effects of Gene Polymorphisms on Drug Response 

GENE 

ENZYME/TARGET 

DRUG 

CLINICAL RESPONSE 

CYP2D6 

Cytochrome P4502D6 

Codeine 

Persons homozygous for an inactivating mutation 
do not metabolize codeine to morphine and thus 
experience no analgesic effect 

CYP2C9 

Cytochrome P4502C9 

Warfarin 

Persons heterozygous for a polymorphism need a 
lower dose of warfarin to maintain anticoagulation 

VKORC1 

Vitamin K epoxide reductase 

Warfarin 

Persons heterozygous for a polymorphism need a 
lower dose of warfarin complex, subunit 1, to 
maintain anticoagulation 

NAT2 

A/-Acetyl transferase 2 

Isoniazid 

Persons homozygous for slow-acetylation 
polymorphisms are more susceptible to isoniazid 
toxicity 

TPMT 

Thiopurine S-methyltransferase 

Azathioprine 

Persons homozygous for an inactivating mutation 
develop severe toxicity if treated with standard 
doses of azathioprine 

ADRB2 

P-Adrenergic receptor 

Albuterol 

Persons homozygous for a polymorphism get 
worse with regular use of albuterol 

KCNE2 

Potassium channel, voltage-gated 

Clarithromycin 

Persons heterozygous for a polymorphism are 
more susceptible to life-threatening arrhythmias 

SURl 

Sulfonylurea receptor 1 

Sulfonylureas 

Persons heterozygous for polymorphisms exhibit 
diminished sensitivity to sulfonylurea-stimulated 
insulin secretion 

F5 

Coagulation factor V (Leiden) 

Oral contraceptives 

Persons heterozygous for a polymorphism are at 
increased risk for venous thrombosis 
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broader effect on the body’s response to multiple drugs. An 
example is debrisoquine hydroxylase, an enzyme encoded by 
the gene CYP2D6. This gene is a member of the cytochrome 
P450 superfamily, which encodes many different enzymes 
responsible for the biotransformation of compounds with 
widely divergent chemical structures. Polymorphisms of 
CYP2D6 affect the metabolism of more than 25% of all phar¬ 
maceuticals, including (J-adrenergic receptor antagonists, 
neuroleptics, and tricyclic antidepressants (Fig. 14-1). 
Another example is variants in CYP2C19 that diminish the 
rate at which the pro-drug clopidogrel, an antiplatelet drug, 
is metabolized to its active form. Clopidogrel is commonly 
used in the more than 2 million persons who undergo stent¬ 
ing of their coronary arteries, and persons with a variant in 
CYP2C19 that reduces function are at substantially higher 
risk of stent thrombosis—which can precipitate a heart attack 
and death. All of these are examples of relatively simply 


genetic profiles (i.e., single polymorphisms) that affect drug 
response. Many drug responses are likely to be determined by 
much more complex profiles that are composed of multiple 
polymorphisms at multiple loci. 

Two common variants of CYP2C9 (CYP2C9*2 and 
CYP2C9*3), another cytochrome P450 gene, influence the 
metabolism of warfarin, an anticoagulant drug. The frequen¬ 
cies of these alleles vary between 6% and 12% in populations 
of European origin, but each is found at a substantially 
lower frequency in sub-Saharan Africans and East Asians. 
Warfarin is widely used to prevent thrombosis, but because 
of variation in dose requirements, hemorrhagic complica¬ 
tions from warfarin therapy are common. Therefore, a per¬ 
son’s level of anticoagulation needs to be checked regularly 
so that warfarin is given at a dose that prevents thrombosis 
but avoids excessive bleeding. Persons with at least one copy 
of either CYP2C9*2 or CYP2C9*3 require less warfarin for 
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FIG 14-1 Genotype-phenotype relationships between CYP2D6 polymorphisms and drug metabo¬ 
lism. A, Possible genotypes at the CYP2D6 locus. Fully functional alleles of the CYP2D6 gene 
are indicated by dark purple boxes, alleles with reduced function in pink, and null (i.e., inactive) 
CYP2D6 alleles in light purple. B, The ability to metabolize many drugs varies depending on an 
individual's C/PPObgenotype. C, Distribution of phenotype frequencies assessed in a population 
of European Americans as determined by the urinary metabolic ratio (MR) of debrisoquine to 
4-hydroxy-debrisoquine. D, Poor metabolizers require a smaller dose of the antidepressant drug 
nortriptyline, and ultrarapid metabolizers require a higher dose to achieve the same plasma 
concentration. (Adapted from Meyers U. Pharmacogenetics—five decades of therapeutic lessons 
from genetic diversity. Nat Rev Genet. 2004:5:669-676.) 
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effective anticoagulation than the general population. Con¬ 
sistent with this observation, hemorrhagic complications are, 
on standard dosing, more common in persons who carry the 
CYP2C9*2 or the CYP2C9*3 alleles. Thus, CYP2C9 variants 
influence both warfarin metabolism and adverse outcomes 
associated with warfarin. Genetic variation in one of warfa¬ 
rin’s pharmacologic targets, vitamin K expoxide reductase 
(VKORCl; see Table 14-1), also helps to predict a person’s 
response to this drug. Genetic testing can be done on both 
CYP2C9 and VKORCl to help to calibrate warfarin dosage. 

Pharmacogenetics and pharmacogenomics are slowly 
beginning to change the way that medicine is practiced, 
although the pace of change is likely to accelerate over the 
next few decades (Box 14-2). A primary issue for all alleles 
that are associated with drug response is whether testing these 
alleles will affect the clinical management of patients and, if 
so, to what extent. The genetic profile of a drug response may 
be important if the drug is widely used in clinical practice 
and the response to the drug is medically important, if the 
drug’s therapeutic and toxic effects are difficult to assess and 
titrate clinically, if adverse effects are difficult to predict with 
existing information, and if a profile provides easily interpre¬ 
table results with high sensitivity and specificity. It is unclear 
how many drug-and-genetic profile combinations are likely 
to meet these criteria, although to date more than a hundred 
drugs now carry a label indicating that genetic testing should 
be performed prior to use. 


Genetic testing for polymorphisms associated 
with variation in drug metabolism or effective¬ 
ness can lead to better predictions of a per¬ 
son's response to drugs and can reduce the 
incidence of drug-related side effects. 

Diagnosing and Monitoring Common Disease 

In the previous sections, we have explained how genomic 
information can be used to personalize risk assessments for 
rare and common diseases as well as drug responses. Genomic 
information can also be used to facilitate disease diagnosis 
and to monitor therapeutic responses. For example, a micro¬ 
array or RNA sequencing (see Chapter 3) can be used to 
estimate the expression level of each gene (i.e., the amount 
of mRNA that is transcribed) in a specific tissue. These gene- 
expression profiles can be used to identify patterns of 
gene expression that are associated with specific conditions 
(e.g., increased transcription of an oncogene or reduced tran¬ 
scription of a tumor suppressor gene in tumor tissue). 
Such information can help to distinguish different types of 
cancers, different types of infections, or other phenotypes 
associated with disease. It should be appreciated that genomic 
information is just one of many types of data about a person 
and their environment that can be assessed to make predic¬ 
tions about disease states. This includes information about 
protein expression (i.e., proteome), metabolic functions 



BOX 14-2 Personal Genomics 


It is 2025. Jonathan is a 1 hour-old baby sleeping comfortably 
In his mother's arms In the room in which he was just born. 
Jonathan's parents are comforted by the knowledge that Jona¬ 
than does not have a serious childhood-onset genetic condition 
such as cystic fibrosis, sickle cell disease or one of several 
hundred inborn errors of metabolism (e.g., galactosemia, PKU). 
Testing for these conditions used to be part of newborn screen¬ 
ing programs, but nowadays testing for these conditions 
occurs prenataliy. In most cases, this screening is done via 
whole genome sequencing of the fetal genome from fetal free- 
DNA found in a pregnant mother's blood. Jonathan's parents 
did learn that he is a carrier for sickle cell disease. An electronic 
summary of Jonathan's complete genome sequence is depos¬ 
ited in a national health information database prior to his birth. 
A subset of genotypes that represent a unique genetic profile 
are put Into a national forensics database. 

Jonathan's parents can access his genome sequence infor¬ 
mation, specifically Information about risk variants that he has, 
at any time. They also control who has access to these risk 
data, and they elect to provide Jonathan's pediatric care pro¬ 
viders with risk variants for conditions that commonly mani¬ 
fest in childhood. On Jonathan's well-child visit at 1 month of 
age, a genetic counselor explains that Jonathan has a higher- 
than-average genetic risk for autism, peanut allergy, chronic 
otitis media, and adverse responses to penicillin. It Is recom¬ 
mended to his parents that he avoid both penicillin and prod¬ 
ucts containing peanuts until he can undergo direct testing. 


Jonathan is also found to have a lower-than-average genetic 
risk for asthma. 

At 1 year of age, it is apparent that Jonathan's speech and 
language development Is delayed. His genetic profile confirms 
that he has no risk variants known to be associated with 
hearing loss, suggesting that his delay might be an early Indica¬ 
tion of autism. An optimal therapy for autism is selected based 
on his genetic profile. Jonathan responds well to this Interven¬ 
tion and. In conjunction with speech training, his development 
by the age of 5 years is appropriate. 

Jonathan remains healthy throughout childhood, and, when he 
turns 18 years old, control over his genetic risk data is switched 
from his parents to him. At the same time, Jonathan's medical 
care is transferred to a family physician. At his first appointment, 
Jonathan's physician explains his risk for heart disease, hyper¬ 
tension, obesity, type 2 diabetes, and colon cancer. Jonathan is 
alerted to his high risk of developing both diabetes and obesity, 
and a program of exercise and diet that has been shown to delay 
disease onset is recommended to him. 

Ten years later, Jonathan informs his physician that he and 
his wife are planning to start a family. His wife is also a carrier 
for sickle cell disease and has several risk variants for asthma, 
so they are referred for further counseling about prenatal 
genetic testing options. When Jonathan is 45, he develops 
hypertension, and, based on his profile of drug response vari¬ 
ants, therapy is initiated with a specific antihypertensive agent 
to which he is likely to respond. 
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(i.e., metabolome), normal body flora (i.e., microbiome), and 
environmental exposures (i.e., exposome). 

Cancer Genomics 

Every cancer cell harbors numerous alterations in DNA 
sequence and copy number that affect genes or regulatory 
sequences, often accompanied by reversible, epigenetic, mod¬ 
ifications. These changes perturb the expression and/or func¬ 
tion of hundreds to thousands of genes. Collectively, these 
changes result in the activation or inhibition of various cel¬ 
lular pathways that control the characteristics of cancers such 
as growth or metastasis, and they determine, in part, progno¬ 
sis and response to treatment. Cancer genomics is the study 
of the DNA-associated changes that accompany cancer with 
the overall goal of better preventing, detecting, diagnosing, 
and treating common cancers. 

A particularly powerful application of genomics to cancer 
has been the use of genome-wide gene expression analyses to 
provide a snapshot of gene activity within a tumor at a given 
point in time. This has facilitated the development of classi¬ 
fication schemes based on expression profiles for many types 
of cancer, including leukemia, lymphoma, and cancers of the 
breast, lung, colon, and brain. This information can be used, 
for example, in refining prognosis, directing the application 
of conventional and targeted biological therapies, and iden¬ 
tifying targets for new drug development (Fig. 14-2). 

Currently, it is often difficult to predict the prognosis of 
cancer patients based on traditional phenotypic information 
such as the type of tumor (T), whether the cancer is found 
in nearby lymph nodes (N), and evidence of metastasis (M). 
Staging using this TNM system is currently the standard for 
most solid tumors, yet these stages are often not predictive of 
prognosis or treatment response. Gene-expression profiling 
can help to distinguish between cancers that are easily con¬ 
fused (e.g., Burkitt lymphoma vs. diffuse large B-cell lym¬ 
phoma). ft can also facilitate the identification of subsets of 
tumors of the same TNM stage that might have quite differ¬ 
ent outcomes. Several gene-expression profiles are currently 
available for assessing breast cancer prognosis, and gene- 
expression profiles that predict recurrence of several other 
types of cancer have been established. Prospective trials will 
determine the extent to which the use of expression profiling 
is of clinical benefit, but it is anticipated that its use will lead 
to a substantial improvement in cancer management. 

The conventional approach to cancer therapy has been to 
provide treatment based on the tissue or organ in which the 
cancer originated. However, persons with the same type of 
cancer often have different genetic abnormalities in their 
tumors, resulting in differential responses to treatment. For 
example, among young women whose breast cancer has not 
spread to their lymph nodes and who are treated by resection 
of the tumor and local radiation, only 20% to 30% will expe¬ 
rience a recurrence. This subgroup of women might benefit 
the most from receiving adjuvant chemotherapy, and those 
at lower risk of recurrence (the majority) might benefit less 
from chemotherapy. Yet, because the high- and low-risk 
groups cannot be distinguished reliably, 85% to 95% of all 



Prognosis reporter 
genes 

FIG 14-2 Prediction of disease outcome by gene-expression 
profiling. The clinical outcome of individuals with lung cancer 
(circled tumor on radiograph) is predicted by testing the 
expression of a set of genes known to be abnormally regu¬ 
lated in lung cancer cells. For each individual tumor, RNA is 
extracted and placed onto a microarray, and the expression 
of each gene is measured. Bottom, Each column represents 
the expression profile of a different tumor. Diminished 
expression of a gene in one lung tumor compared to other 
lung tumors is indicated by green, and increased expression 
is indicated by red. The outcome of the disease is shown at 
the right, where white indicates persons with metastatic 
disease (poor outcome) and black indicates no metastasis 
(good outcome). 
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women with this type of breast cancer receive adjuvant che¬ 
motherapy. This means that many women might undergo 
such treatment unnecessarily, putting them at risk for drug- 
related complications and increasing the overall cost of health 
care. Expression profiling has the potential to help delineate 
subsets of cancers that are likely to be more responsive to 
various therapeutic regimens and to guide the optimal selec¬ 
tion of agents for each individual. 

Gene-expression profiling of cancers is helping 
to improve the classification of different types 
of tumors and may help to guide therapy. 

Common Disease 

Gene-expression profiling is being used to study the patho¬ 
genesis of common diseases and to monitor tissue-specific 
gene activity in order to facilitate diagnosis and monitor 
disease progression and treatment. For example, expression 
profiling of circulating white blood cells in patients with type 
1 diabetes has revealed increased expression of a large number 
of proinflammatory genes. The expression of some of these 
genes is also increased in persons with rheumatoid arthritis, 
suggesting that some autoimmune disorders might share 
expression profiles. A screening test based on these profiles 
might enable earlier diagnosis and/or identify high-risk 
persons who could benefit from preventive care. Gene profil¬ 
ing has also been used effectively to monitor immunosuppres¬ 
sion in individuals who have undergone organ transplantation. 
Studies are also under way to identify whether gene-expression 
profiles can predict outcome in persons infected with patho¬ 
gens such as malaria, HIV-1, and tuberculosis. 

Race and Genetic Assessment of 
Individual Ancestry 

An important and controversial issue in precision medicine 
is whether a person’s race—using its historical meaning as a 
descriptor of Africans, Asians, Europeans, Native Americans, 
and Pacific Islanders—and/or genetic ancestry is useful for 
making predictions about health-related risks. Traditionally, 
it has been commonplace to use race to predict the likelihood 
that a person carries a particular genetic variant that influ¬ 
ences susceptibility to disease or drug response. This practice 
was based partly on the observation that disparities in health 
are common among racial groups. For example, the incidence 
of prostate cancer is twofold higher in African American men 
than in European American men. Other disorders that vary 
in prevalence or outcome among racial groups include hyper¬ 
tension, end-stage renal disease, preterm birth, and type 2 
diabetes. It remains unclear, however, whether genetic risk 
factors explain, even partly, these disparities. Many health- 
related disparities probably are influenced more strongly 
instead by environmental factors such as dietary differences 
and inequities in the provision of health-care services. 
Accordingly, the use of race to make predictions about 
whether a person has such risk factors is still the subject of 
considerable debate. 


It is important to distinguish between race and genetic 
ancestry. Race has traditionally been used to categorize large 
groups of persons and can reflect geographic origin, lan¬ 
guage, and various cultural attributes that describe a group 
(e.g.. Native Americans or Asians). Ancestry refers to the geo¬ 
graphic, historical, or biological origins of one’s ancestors 
and, for any person, can be complex. For example, a person 
might have ancestors from Africa, Europe, and North America 
(i.e., a complex ancestry), but he or she might still self-identify 
as an African American. Therefore, race captures some bio¬ 
logical information about ancestry, but the two concepts are 
not equivalent. Knowledge of a person’s ancestry can provide 
information about his or her genetic makeup and thus can 
be useful for identifying genetic and environmental factors 
that underlie common diseases. Accordingly, over the past 
several years, it has become increasingly common to use 
genetic markers such as single nucleotide polymorphisms 
(SNPs) to directly estimate the genetic ancestry of a person 
(Fig. 14-3). The extent to which race helps us to predict 
genetic differences that influence health depends partly on 
how well traditional classifications of race correspond with 
such genetic inferences of individual ancestry. 

On average, persons chosen at random from different 
populations, such as sub-Saharan Africans, Europeans, and 
East Asians, will be only slightly more different from one 
another than persons from the same population, reflecting 
the fact that all humans are quite similar in their DNA 
sequence (see Chapter 3). Common disease-associated poly¬ 
morphisms, such as those associated with response to 
antihypertensive agents (see earlier), often differ in these 
populations only in their frequency. Few, if any, genetic vari¬ 
ants are found in aU members of one major population and 
in no members of another major population. For this reason, 
population affiliation or race is not a reliable predictor of 
individual genotypes. 

It is possible, however, to allocate individuals to groups 
that correspond to different geographic regions by simultane¬ 
ously analyzing several hundred or more variants, such as 
SNPs (see Fig. 14-3). These variants differ in frequency among 
geographic regions because our ancestors were more likely to 
mate with close neighbors than with distant ones. Thus, 
proxies for geographic ancestry such as race can sometimes 
be used to make reasonably accurate predictions of a person’s 
genetic ancestry. Indeed, several studies done in the United 
States have reported that classification of persons by self- 
identified population group is highly correlated with infer¬ 
ences based on genetic data. 

In many circumstances, however, race is not a good predic¬ 
tor of ancestry. For example, populations from neighboring 
geographical regions typically share more recent common 
ancestors, and therefore their allele frequencies can be very 
similar. Gonsequently, persons sampled at regular intervals 
across some intercontinental regions (e.g., the Middle East or 
Central Asia) are more difficult to allocate into genetic groups 
that are concordant with common notions of race. Corre¬ 
spondence with geography is also less apparent for popula¬ 
tions (e.g., Latin Americans, South Asians) that have been 
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Asian ancestry 



Self-identified group 
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I Asian American ^ 
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FIG 14-3 Genetically inferred ancestry fractions for persons 
(colored circles) sampled from the United States and geno- 
typed for 6000 single nucleotide polymorphisms (SNPs). 
Each circle represents one person, color-coded to correspond 
to one of four self-identified groups. The distance of a circle 
to the edge of the triangle is proportional to the amount of 
the person's ancestry contributed by each of the three ances¬ 
tral populations in the corners of the triangle (African, Asian, 
and European). For example, the Elispanic/Latino American 
labeled number 4 received about 60% of his genetic ancestry 
from Europe, 30% from Asia (due to Native American ances¬ 
try), and 10% from Africa. The circles representing Elispanic/ 
Latino and African Americans are less tightly clustered 
because the proportion of ancestry among persons is more 
varied than in Asian Americans and Americans of European 
descent. A bar graph indicates the estimated ancestry propor¬ 
tions for each of the subjects labeled 1-5. 


influenced by recent historical mixtures of multiple ancestral 
populations. 

In the United States, race is only a crude predictor of a 
person’s genetic ancestry. For example, the average portion of 
African ancestry among self-identified African Americans is 


about 80%, but it ranges from 100% to 20% or even less in 
some persons. The genetic composition of self-identified 
European Americans also varies, with about 30% of Euro¬ 
pean Americans estimated to have less than 90% European 
ancestry. Similarly, Hispanics from different regions of the 
United States have highly variable ancestries (e.g., more 
African ancestry in Hispanics living in the Southeast and 
more Native American ancestry in the Southwest). Accord¬ 
ingly, memhership in a group does not mean that all members 
of the group necessarily have similar genetic ancestries. 

Although it is clear that explicit genetic information, 
rather than race, can be used to make more accurate infer¬ 
ences of ancestry, it is not yet known to what extent personal 
ancestry information can make useful predictions about one’s 
risk of common disease. The consequences of using detailed 
ancestry information in a clinical setting are also largely 
unknown. It is possible that personal ancestry information 
could have adverse effects on a person’s perception of risk and 
cultural identity. Similarly, such information could reinforce 
unfair stereotypes about specific populations. Further 
research is needed to examine the potential benefits and risks 
of using ancestry information in clinical practice. 



The relationship between ancestry and tradi¬ 
tional concepts of race is complex. Genetic 
information, rather than race, is a better pre¬ 
dictor of ancestry. 


THE FUTURE OF PREDICTIVE MEDICINE 

Genetic variants that influence the risk of common disease 
or response to treatment are now being found with increas¬ 
ing speed and efficiency (see Chapter 12), but only a small 
fraction of the genetic basis of disease risk and treatment 
response has thus far been defined. In addition, the interac¬ 
tions of multiple disease-predisposing gene products, and 
their interactions with nongenetic factors, remain almost 
completely unknown. Thus, the promise of predictive medi¬ 
cine, in which a detailed genetic profile can provide clinically 
useful risk information for common diseases such as diabe¬ 
tes, cancer, or heart disease, remains largely unfulfilled. It is 
hoped that with increasing knowledge of alleles that predis¬ 
pose persons to disease, genetic testing will begin to contrib¬ 
ute more substantially to the diagnosis and treatment of 
disease. It must also be borne in mind that nongenetic 
factors, such as diet and exercise, are also part of each per¬ 
son’s risk profile. These factors can and should be assessed 
and modified to maximize each person’s potential for a long 
and healthy life. 
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I STUDY QUESTIONS 


1. Explain how genetic information can be used to enhance 
the practice of predictive medicine compared to a conven¬ 
tional model of medical service. Give at least one example. 

2. Different individuals with the same type of cancer often 
respond differently to therapy. Provide at least two possi¬ 
ble explanations of this observation. 

3. Define race and ancestry; explain the differences between 
them. 

4. Consider how explicit genetic information about your 
ancestry might change your perception of your biological 
and cultural identities. 


5. Give an example of a polymorphism that affects drug 
metabolism and/or the response to a drug. 

6. Explain some of the possible obstacles to the use of genetic 
information in practicing precision medicine. 

7. Distinguish between genetic medicine and genomic 
medicine. 

8. Give examples of how the availability of whole-genome 
data from individuals might change the ways medicine is 
currently practiced. 
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Clinical Genetics and Genetic Counseling 


In the last 2 decades medical genetics has clearly emerged as 
a recognized specialty in mainstream medicine. In the 1960s, 
the fields of biochemical genetics, clinical cytogenetics, and 
dysmorphology (the study of abnormal physical develop¬ 
ment) arose as disciplines primarily within pediatrics. The 
1970s witnessed the establishment of the techniques neces¬ 
sary for the prenatal diagnosis of genetic disorders. By the end 
of the 1970s, discussions about forming the American Board 
of Medical Genetics had occurred, and in 1981 the first cer¬ 
tification examination was administered. The American 
Board of Genetic Counseling was established in the early 
1990s, and now various types of geneticists, including genetic 
counselors, medical geneticists, and basic human geneticists, 
can be certified as specialists within the discipline. In 1991 
the American Board of Medical Specialties recognized this 
new field, and medical genetics has now become an integral 
part of medicine. 

Whereas medical genetics involves the study of the genet¬ 
ics of human disease, clinical genetics deals with the direct 
clinical care of persons with genetic diseases. The diagnostic, 
counseling, and management issues surrounding genetic 
disease are the principal foci of clinical genetics. 

In this chapter, we summarize the principles of clinical 
genetics and the process of genetic counseling. In addition, 
we provide an overview of the field of dysmorphology, 
because the growth of this area has influenced and paralleled 
the emergence of clinical genetics. 

THE PRINCIPLES AND PRACTICE OF 
CLINICAL GENETICS 

As mentioned in Chapter 1, genetic conditions as a group are 
common and are a significant cause of human mortality and 
morbidity. Typically, genetic disorders are complex, multior¬ 
gan, systemic conditions, and the care of persons with these 
disorders can also involve multiple medical specialties. Thus, 
genetic disorders are among the differential diagnosis of 
most symptoms and clinical presentations. For example, 
when evaluating an infant with a blistering skin disease, the 
ability to distinguish between one of the many forms of epi¬ 
dermolysis bullosa (an inherited disorder of keratinocytes in 
which skin blisters develop after mild trauma) and staphylo¬ 
coccal skin disease must be part of the clinician’s repertoire. 

Because of the complexity and number of human genetic 
diseases, their clinical diagnosis and treatment can seem over¬ 
whelming. To help manage this information, we provide an 


overview of the most important concepts, including the 
importance of accurate diagnosis, the application of the 
tenets of medical genetics to medical practice, and the role of 
genetic counseling in the care of persons with genetic disease. 

Accurate Diagnosis 

The significance of the basic medical principle of accurate 
diagnosis cannot be overemphasized. The process of genetic 
counseling, one of the principal components of the service of 
medical genetics, begins with correct diagnosis. All discus¬ 
sions of natural history, prognosis, management, risk deter¬ 
mination, options for prenatal diagnosis, and referral to 
genetic support groups (also termed genetic advocacy groups) 
depend on an accurate diagnosis of the patient’s condition. 
For example, genetic counseling for a family who has a son 
with an intellectual disability usually involves questions of 
risk for this condition in future offspring. An accurate answer 
requires the clinician to identify a condition of known etiol¬ 
ogy. Once a specific diagnosis (e.g., fragile X syndrome) is 
made, then the genetic counseling process begins: Current 
information can be shared and management can be initiated 
(Clinical Commentary 15-1). 

In clinical genetics, as in all of medicine, accu¬ 
rate diagnosis is the most important first step 
in patient care. 

The process of diagnosing a genetic disorder is a complex 
sequence of events. It depends upon diagnostic decision 
making, recognition of important phenotypic signs, applica¬ 
tion of principles of dysmorphology and medical genetics, 
and laboratory diagnosis. For diseases in which the diagnostic 
criteria are well established, the practitioner has guidelines 
for making a diagnosis. An example of such criteria is the 
criteria recommended by the National Institutes of Health 
Consensus Development Conference for the diagnosis of 
neurofibromatosis type 1 (NFl; see Chapter 4). For condi¬ 
tions that are defined by a specific laboratory marker, such as 
an abnormal karyotype or biochemical assay, the diagnostic 
procedure is generally straightforward. For many genetic dis¬ 
eases, however, there are no well-established criteria, the defi¬ 
nition and delineation of the disorder are not clear-cut, and 
diagnosis can be challenging. 

Dysmorphic syndromes require knowledge and skills in 
the recognition of mild malformations, minor anomalies, 
and phenotypic variations. The diagnosis of other genetic 
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CLINICAL COMMENTARY 15-1 
Reasons for Making a Diagnosis of a Syndrome 

The long list of syndromes associated with congenital malfor¬ 
mations can be overwhelming to the clinician. More than 
400 conditions are listed in Smith's Recognizable Patterns of 
Human Malformations, and more than 1000 are accessible 
through the POSSUM or London Dysmorphology computer¬ 
ized databases (see Internet Resources at the end of this 
chapter). This number imparts a sense that the diagnosis of a 
malformation syndrome lies in the arena of academic trivia. 
However, this is not the case. 

Consider, for example, the child who is large for gestational 
age and has a number of physical abnormalities: omphalocele 
(intestinal protrusion at the umbilicus), large tongue, facial hem¬ 
angioma, flank mass, and asymmetrical limb length (Fig. 15-1). 
His family has questions such as, "What does he have?," 
"How will he do?," "Will he look different?," "Will he have 



FIG 15-1 A child with Beckwith-Wiedemann syndrome. 
Note the prominent eyes and large, protruding tongue. 


developmental delays?," "What is the chance of his condition 
occurring again in another child?" 

By putting these features together and making the pattern 
recognition diagnosis of the Beckwith-Wiedemann syndrome, 
the clinician is able to answer all of the parents' questions fairly 
precisely. Most cases of Beckwith-Wiedemann syndrome 
occur sporadically, but some are inherited. In addition, the 
genes that are involved in causing the condition exhibit imprint¬ 
ing effects (see Chapter 5). If the child has the typical methyla- 
tion alteration that causes the majority of cases, the sibling 
recurrence risk is low and likely less than 1 %. If the results of 
the methylation analysis are normal, the recurrence risk is 
higher, and linkage or mutation analysis can provide a more 
precise risk estimate. In future pregnancies, prenatal diagnosis 
using ultrasound can test for an omphalocele in the second 
trimester and for large size for gestational age, excessive 
amniotic fluid (polyhydramnios), and large tongue. If a fetus is 
thought to have Beckwith-Wiedemann syndrome, then the 
delivery plan would change and the baby should be born in a 
tertiary care center. 

Children with the Beckwith-Wiedemann syndrome do not 
usually have cognitive disabilities. Although the large tongue 
can cause orthodontic problems, speech difficulties, and occa¬ 
sionally upper airway problems, these conditions usually 
improve as the child gets older. The facial appearance is not 
strikingly abnormal in later childhood. 

Chromosome analysis should be considered, although most 
Beckwith-Wiedemann patients do not have the chromosome 
11 duplication that has been reported in a small number of 
cases. Otherwise, the main emphases of the medical care plan 
include regular abdominal sonogram until age 7 years to look 
for intra-abdominal malignancies, especially Wilms tumor and 
hepatoblastoma. Children with Beckwith-Wiedemann syn¬ 
drome have a 5% to 10% risk of developing these tumors, and 
both types are treatable if detected early. 

In this example, it was important to diagnose the Beckwith- 
Wiedemann syndrome. The correct label led to precise infor¬ 
mation for genetic counseling, prediction of natural history 
(including reassurance), organization of appropriate laboratory 
studies, a health supervision plan, and referral to a lay advocacy 
group. Diagnosis was helpful to the parents, the family physi¬ 
cian, and the child. 


diseases, including cancer syndromes and inborn errors of 
metabolism, can require expertise from a variety of disci¬ 
plines. For instance, the diagnosis of any of the forms of reti¬ 
nitis pigmentosa (see Chapter 8) necessitates input from an 
ophthalmologist who is familiar with this group of retinal 
degenerative conditions. The diagnostic process is further 
complicated by the variable expression, incomplete pene¬ 
trance, and heterogeneity of many genetic diseases. These 
concepts are discussed in Chapter 4. 

Application of the Principles of Medical Genetics 

Developing a genetic approach to human disease in the clini¬ 
cal setting requires the application of all of the basic 


principles of medical genetics discussed in this book. This is 
often called “thinking genetically.” For example, making or 
excluding the diagnosis of NFl requires knowledge of the 
clinical variability and age of onset of certain features of 
the condition (Clinical Commentary 15-2). Recognition of 
the various forms of neurofibromatosis (i.e., heterogeneity) 
is also important in developing a management plan. 

Knowledge of the other formal principles of medical 
genetics is also necessary in the care of persons with genetic 
conditions. The accumulation of family history data and 
the interpretation of pedigree information are important in 
answering a family’s questions regarding risk of recurrence. 
An understanding of the various modes of inheritance is 
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CLINICAL COMMENTARY 15-2 
The Negative Family History 

One of the common discussions on ward rounds is the notation 
that a person's family history is negative or "noncontributory." 
This is often thought to rule out a genetic disorder. However, 
the majority of persons who have a genetic disease do not 
have a positive family history. A quick review of the mecha¬ 
nisms of mendelian, chromosomal, and multifactorial disease 
inheritance shows that a lack of other affected persons in the 
family is common and does not by any means exclude the 
presence of a genetic disease. For example, the sibling recur¬ 
rence risk is 25% for diseases with autosomal recessive 
inheritance. Thus, a significant number of families with multiple 
offspring have only one affected child and no family history. 
Even some well-established autosomal dominant disorders 
often present a negative family history because of high propor¬ 
tions of new mutations (examples include Marfan syndrome, 
neurofibromatosis type 1 [NFI], and achondroplasia, in which 
the percentages of cases caused by new mutations are 30%, 
50%, and 80%, respectively). Chromosomal syndromes usually 
have a low recurrence risk. Even when a parent carries a bal¬ 
anced chromosome rearrangement, the recurrence risk among 
the offspring is usually less than 15%. The sibling recurrence 
risks for multifactorial conditions are usually 5% or 
less. 

Case 

A family comes in with a 6-year-old boy who has 10 cafe-au-lait 
spots exceeding 0.5 cm in diameter widely distributed on the 
trunk and arms and an optic glioma detected on a brain MRI 
(Fig. 15-2). The family has questions about the diagnosis and 
the recurrence risk in future pregnancies. On initial telephone 
contact it is learned that there is no history of a family member 
with similar features. 

There are several possible explanations for this finding. 
Exploring them underscores the implications of a negative 
family history: 

• New mutation of the NFI gene. Because of the relatively 
high percentage of new mutations for this disorder, this is 
the most likely explanation. 

• Variable expression. It is also possible that one of the parents 
carries the gene but has mild expression of the phenotype. 
Occasionally a parent has multiple cafe-au-lait spots and 
a few unrecognized neurofibromas, but a diagnosis of NFI 
has never been made. Thus, it is important to evaluate the 
parents for mild expression of NFI. 

• Incomplete penetrance. This is a possibility; however, it is 
unlikely for NFI, in which penetrance is close to 100%. If a 
family has two children with NFI and neither parent has 



FIG 15-2 A 6-year-old boy with multiple cafe-au-lait spots. 
(From Burger P, Scheithauer B, Vogel FS: Surgical Pathol¬ 
ogy of the Nervous System and Its Coverings, 4th ed. Phila¬ 
delphia: Churchill Livingstone; 2002.) 


the gene, germline mosaicism would be the more likely 
explanation. 

• Incorrect diagnosis. One of the assumptions and basic prin¬ 
ciples of medical genetics is accurate diagnosis. This patient 
meets the National Institutes of Health established criteria 
for NFI (see Chapter 4). However, if this patient had only 
cafe-au-lait spots, then the diagnosis would have been an 
issue. One would need to know the differential diagnosis of 
multiple cafe-au-lait spots. 

• False paternity. Although it is relatively unlikely, this possibil¬ 
ity must be kept in mind. 

We began with an individual who had a classic autosomal 
dominant disorder with no family history. This can be 
explained in a number of ways. The statement that there is 
"a negative family history" should not be considered conclu¬ 
sive evidence against the presence of a heritable condition. 


required in any explanation of recurrence risk. Discussion 
of the concepts of new mutation and pleiotropy are com¬ 
monplace in reviewing the cause and pathogenesis of a 
genetic disease with a family. Even an understanding of 
meiosis is a requirement for discussions of etiology with 
the family of a newborn with Down syndrome (Clinical 
Commentary 15-3). 


Genetic Counseling: Definition and Principles 

Genetic counseling represents one of the central foci of 
medical genetics and with accurate diagnosis comprises the 
cornerstones of the field. At first glance, use of the term 
“counseling” implies that this service lies in the domain of 
mental health, social work, or psychotherapy. In fact, genetic 
counseling is centered in the conventional medical model 
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CLINICAL COMMENTARY 15-3 

Talking to the Parents of a Newborn with Down Syndrome 

The birth of a newborn with Down syndrome presents many 
challenges. Typically, the infant is not acutely ill and the parents 
are not aware of the diagnosis before the birth. Thus, the 
practitioner must approach the parents, often strangers, with 
unexpected and sometimes disappointing news. The family 
can experience a series of emotions that are somewhat similar 
to the reactions after a loss: anger, denial, sadness, and then 
usually reorganization and adaptation. Families face these situ¬ 
ations with markedly different backgrounds: varying attitudes 
toward crisis, different demographic and socioeconomic cir¬ 
cumstances, and even a wide range of differences in the cul¬ 
tural meaning of a disability or defect. All of these variables, 
plus the fact that physicians are often not trained in being the 
bearers of difficult news, can make this a challenging situation. 
Parents remember in detail the way the news is presented. 
The practitioner has both the opportunity and the challenge to 
help the family through these events. 

A number of practical suggestions have come from studies 
investigating the recommendations of parents who have expe¬ 
rienced this event: 

• Prepare yourself. Set up the interview scenario, and think 
about how you will begin the discussion. 

• Talk to both parents together whenever possible. This is 
sometimes not practical, but when it can be accomplished, 
it is critical. 

• Communicate the diagnosis as soon as possible. All studies 
of parental interviews show that they prefer early communi¬ 
cation of the diagnosis. 

• Choose a place that Is private and quiet where both the 
parents and the professionals can sit down. Avoid standing 
up with the parents seated. Always be sure to introduce 
yourself. Structure the interview from the beginning. 

• Humanize the situation as much as possible. Learn the 
baby's first name if it has been decided on, and always 
know the baby's gender. Refer to the infant by name or as 
a son or daughter, and be aware of the use of all language. 
Phrases such as "mental retardation" have great impact, and 


currently that term is not considered the appropriate desig¬ 
nation by professionals in pediatrics, psychology, and medical 
genetics (it has been replaced by intellectual disability or 
cognitive disability). Terms such as "mongolism" are not 
appropriate because they are stigmatizing, pejorative, and 
incorrect. 

• Develop a sense of realistic positivism. It is important to 
discuss the developmental limitations in a patient with Down 
syndrome, but it is also important to have an optimistic and 
positive attitude. This suggestion comes from the advocacy 
and parents' organizations that have developed in the last 
four decades. 

• Answer the parents' questions, but avoid technical overload. 
It is important to be accurate and current on the biological 
and medical aspects of the condition under discussion. 
When an answer is not known, mention that the question 
can be reviewed or referred to a consultant. 

• Listen actively. Assume that almost all feelings are natural 
and that parents will be wrestling with their own guilt and 
shame. Validate all feelings that arise. Most parents can 
meet this challenge quite effectively and do not require 
psychiatric consultation. 

• Refer the family to the appropriate resources early. This 
would include parent advocacy groups or even individual 
parents who have a child with Down syndrome. Share avail¬ 
able written material and web pages, but make certain that 
it is accurate and current. 

Above all, be aware of the unique plight of families in such a 
situation, and make an effort to spend time with them. Although 
it is difficult to present in written form how one can develop 
attributes such as kindness and empathy, it is important for 
physicians in training to learn from their mentors and use 
their own individual communication style as a strength. Clearly, 
the recommendations provided here apply not only to genetic 
counseling but also to any situation in which difficult informa¬ 
tion is presented to patients or families. (See Suggested 
Readings at the end of the chapter.) 


because it depends significantly on accurate diagnosis and 
knowledge of medical genetics. As a tradition, genetic coun¬ 
seling grew out of the field of human genetics rather than 
from behavioral science, unlike other counseling disciplines. 

In 1975 the American Society of Human Genetics adopted 
a definition of genetic counseling. Newer language has been 
proposed recently to modernize and clarify this definition, 
but the original language stands the test of time: 

Genetic counseling is a communication process that deals 
with the human problems associated with the occurrence 
or risk of occurrence of a genetic disorder in a family. This 
process involves an attempt by one or more appropriately 
trained persons to help the individual or family to (1) 
comprehend the medical facts, including the diagnosis, 
probable course of the disorder, and the available man¬ 
agement; (2) appreciate the way heredity contributes 
to the disorder and the risk of recurrence in specified 
relatives; (3) understand the alternatives for dealing with 


the risk of recurrence; (4) choose a course of action 
that seems to them appropriate in their view of their risk, 
their family goals, and their ethical and religious stan¬ 
dards, and act in accordance with that decision; and (5) 
make the best possible adjustment to the disorder in an 
affected family member and/or to the risk of recurrence 
of that disorder. 

This definition illustrates the complex tasks that face the 
practitioner. The first task involves establishing the diagnosis 
and discussing the natural history and management of the 
condition. In this regard, the medical care of a patient with a 
genetic disease does not differ from that of a patient with any 
other type of human disease. 

The second task requires an understanding of the basic 
tenets of medical genetics, especially the principles of human 
genetics and risk determination discussed throughout the 
text. For chromosomal and multifactorial disorders, empiri¬ 
cal figures are used to estimate recurrence risk. Knowledge of 
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inheritance patterns is applied to predict the recurrence risk 
of mendelian disorders. However, the clinical issues are often 
complicated by incomplete penetrance, variable expression, 
delayed age of onset, and allelic and locus heterogeneity. In 
some cases, incorporation of additional information using 


the Bayesian probability approach can significantly assist in 
providing estimates (Box 15-1). 

The third and fourth objectives of the genetic counseling 
process underlie the primary differences between the genetic 
model and the traditional biomedical approach. These tasks 


BOX 15-1 Recurrence Risks and Bayes Theorem 


The estimation of recurrence risks was treated at some length 
in Chapters 4 and 5. A typical example of recurrence risk estima¬ 
tion is a case in which a man with hemophilia A, an X-linked 
recessive disorder, produces a daughter (individual 11-1 in the 
following diagram). Because the man can transmit only the X 
chromosome carrying the hemophilia A mutation to his daugh¬ 
ter, she must be a carrier. The carrier's daughter, individual III-6, 
has a 50% chance of receiving the X chromosome carrying the 
mutation and being herself a carrier. Even though the daughter 
in generation III has five normal brothers, her risk remains 50% 
because we know that the mother in generation II is a carrier. 



Suppose now that the woman in generation III produces 
three sons (generation IV), none of whom has hemophilia A. 
Intuitively, we might begin to suspect that she is not a carrier 
after all. How can we incorporate this new information into our 
recurrence risk estimate? 

A statistical principle that allows us to make use of such 
information is called Bayes theorem (the application of Bayes 
theorem is often termed Bayesian analysis or Bayesian infer¬ 
ence). The table below summarizes the basic steps involved in 
Bayesian analysis. We begin with the prior probability that 
the woman in generation III is a carrier. As its name suggests, 
the prior probability denotes the probability that she is a carrier 
before we account for the fact that she has produced three 
normal sons. Because we know her mother is a carrier, this 
woman's prior probability must be 1/2. Then the prior probabil¬ 
ity that she is not a carrier is 1 - 1/2, or 1/2. 



SHE IS A 
CARRIER 

SHE IS NOT 
A CARRIER 

Prior probability 

1/2 

1/2 

Conditional probability 

1/8 

1 

Joint probability 

1/16 

1/2 

Posterior probability 

1/9 

8/9 


Next we take into account the woman's three normal sons 
by estimating the probability that ail three of them would be 
normal given that she is a carrier. Because this probability is 
conditioned on her carrier status, it is termed a conditional 
probability. If she is a carrier, the conditional probability that 
all three of her sons are normal would be (1/2)^, or 1/8. We also 
estimate the probability that all of her sons would be normal 
given that she is not a carrier. This conditional probability is, of 
course, very close to 1. 

Next we want to find the probability that the woman is a 
carrier and that she is a carrier with three normal sons. To 
obtain the probability of the co-occurrence of these two events, 
we multiply the prior probability times the conditional probabil¬ 
ity to derive a joint probability (i.e., the probability of both 
events occurring together, a concept discussed in Chapter 4). 
The joint probability that she is a carrier is then 1/2 x 1/8 = 1/16. 
Similarly, the joint probability that she is not a carrier is 1/2 x 
1 = 1/2. These joint probabilities indicate the woman is 8 times 
more likely not to be a carrier than to be a carrier. 

The final step is to standardize the joint probabilities so that 
the two probabilities under consideration (i.e., being a carrier 
versus not being a carrier) sum to 1. To do this, we simply 
divide the joint probability that the woman is a carrier (1/16) by 
the sum of the two joint probabilities (1/16 + til). This yields 
a posterior probability of 1/9 that she is a carrier and 8/9 that 
she is not a carrier. Notice that this standardization process 
allows us to provide a risk estimate (1/9, or 11%), while pre¬ 
serving the odds of noncarrier versus carrier status indicated 
by the joint probabilities. 

Having worked through the Bayesian analysis, we see that 
our intuition was confirmed: The fact that the woman in ques¬ 
tion produced three normal sons reduced her risk of being a 
carrier substantially, from an initial estimate of 50% to a final 
probability of only 11%. 

Another common application of Bayesian analysis is illus¬ 
trated in part A of the diagram that follows. The male in genera¬ 
tion II is affected with Duchenne muscular dystrophy (DMD), 
a lethal X-linked recessive disease (see Chapter 5). Either his 
unaffected mother is a carrier of the mutation, or he received 
a new mutation on the X chromosome transmitted by his 
mother. It is important to determine whether the mother is a 
carrier or not, because this fact will influence recurrence risks 
for DMD in her subsequent offspring. If the mother has only 
one affected offspring, the probability that she is a carrier can 
be evaluated directly, because one third of all cases of X-linked 
lethal recessive disorders arise as a result of new mutations. 
(To understand this, consider the fact that because females 
have two X chromosomes and males have only one, 2/3 of all 
X-linked disease-causing mutations in a population must be 
found in females. For a lethal X-linked recessive, all of the 
male X chromosomes are eliminated from the population in 
each generation. Yet the frequency of the mutation remains 


Continued 
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BOX 15-1 Recurrence Risks and Bayes Theorem—cont'd 


the same, generation after generation. This is because new 
disease-causing mutations arise at the same rate as the loss 
of mutation-containing X chromosomes. Because one third of 
the mutation-containing X chromosomes are lost each genera¬ 
tion, it follows that one third of the mutations in the population 
must occur as the result of new mutation.) If the probability 
that the affected son received a new mutation is 1/3, then 
the probability that the mother is a carrier—the alternative 
possibility—must be 1 - 1/3, or 2/3. 

In the table below, we use Bayesian analysis to evaluate the 
probability that the mother is a carrier. As in the previous 
example, we derive a prior probability that she is a carrier, 
assuming no knowledge that she has produced an affected 
son. This probability is given by 4|t, where p is the mutation 
rate for the DMD locus (i.e., the probability, per generation, that 
a disease-causing mutation arises at this locus in an individual). 
The derivation of the probability, 4p, is beyond the scope of 
this text, but it can be found elsewhere (see Suggested Read¬ 
ings, Hodge, 1998). Because the prior probability that the 
mother is a carrier is 4p, the prior probability that she is not a 
carrier is 1 - 4p, which is approximately equal to 1 because p 
is very small. The conditional probability that the woman trans¬ 
mits the mutation given that she is a carrier is 1/2 (there is also 
a very small probability that she transmits her normal allele, 
which is then mutated, but this can be ignored). The conditional 
probability that she transmits a mutation given that she is not 
a carrier (i.e., the probability that a new mutation arises in the 
gamete she transmits) is p. We then multiply the prior probabil¬ 
ity that she is a carrier, 4p, by the corresponding conditional 
probability, 1/2, to obtain a joint probability of 2p. The same 
procedure produces a joint probability of p that she is not a 
carrier. Finally, we standardize the joint probabilities to get the 
posterior probabilities. The posterior probability that she is a 
carrier is 2p -4- (2p + p) = 2/3, and the posterior probability that 
she is not a carrier is p (2p -i- p) = 1/3. As expected, these 
probabilities correspond to the ones we obtained by simple 
direct observation. 


Prior probability 
Conditional 
probability 
Joint probability 
Posterior probability 


SHE IS A SHE IS NOT A 
CARRIER CARRIER 

4p 1 - 4p « 1 

1/2 p 

2p p 

2/3 1/3 


Suppose, however, that the woman has had an affected son 
and an unaffected son (see part B of the figure below). This 
gives us additional information, and intuitively it increases the 
possibility that she is not a carrier (i.e., that the one affected 
offspring is the result of a new mutation). In the table below, 
we incorporate this new information. The prior probabilities 
remain the same as before (i.e., we assume no knowledge of 
either of her offspring). But the conditional probability of trans¬ 
mission, given that she is a carrier, changes to account for the 
fact that she now has two offspring: 1/2 x 1/2 = 1/4 (i.e., the 
probability that she did not transmit the mutation to one off¬ 
spring times the probability that she did transmit the mutation 


to the other offspring). The conditional probability that she 
transmitted a new mutation to the affected offspring is p, and 
the probability that she did not transmit a mutation to the unaf¬ 
fected offspring is 1 - p. Thus, the probability of both events, 
given that she is not a carrier, is p x (1 - p) ~ p. The joint 
and posterior probabilities are obtained as before, and we 
see that the woman's chance of being a carrier is now reduced 
from 2/3 to 1/2. Again, this confirms (and quantifies) our 
expectation. 



A B 


SHE IS A SHE IS NOT 

CARRIER A CARRIER 


Prior probability 
Conditional probability 
Joint probability 
Posterior probability 


4p 1 - 4p « 1 

1/4 px(1-p) = p 

p p 

1/2 1/2 


Before the advent of disease diagnosis through linked 
markers or mutation detection, Bayesian analysis was often 
the only way to derive a risk estimate in situations such as 
these. Now, of course, an attempt would be made to identify 
the factor VIII or DMD mutation that causes hemophilia A or 
DMD in these families directly, or, failing that, linked markers 
would be used. This is a much more direct and accurate 
approach for determining carrier status. However, as discussed 
in Chapter 13, it is not always possible to identify the respon¬ 
sible mutation, particularly when a large number of mutations 
can cause the disorder (as is the case for hemophilia A, DMD, 
or cystic fibrosis). Bayesian inference can be used in such 
cases to incorporate the sensitivity of the genetic test (e.g., if 
a standard mutation analysis of the CFTR gene reveals 85% of 
the mutations [see Chapter 13], there is a 15% probability that 
the person in question has the mutation even though the test 
did not reveal it). In addition, linkage analysis is not always 
informative. Thus, Bayesian analysis is still sometimes a useful 
tool for refining risk estimates. 

The additional information incorporated in Bayesian analysis 
is not confined to the assessment of health status in relatives, 
as was shown in these examples. Another type of information 
is a biochemical assay, such as factor VIII activity level, that 
could help to indicate carrier status. Because there is usually 
overlap between carriers and normal homozygotes for such 
tests, the assay cannot determine carrier status with certainty, 
but it does provide a probability estimate for incorporation into 
Bayesian analysis. In diseases with delayed age of onset, such 
as adult polycystic kidney disease, the probability of being 
affected at a certain age can be used in a Bayesian analysis. 
Here, one considers the fact that the at-risk person is less and 
less likely to possess the disease gene if he or she remains 
unaffected beyond a certain age. 
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involve discussing reproductive options and facilitating deci¬ 
sion making. Implicit in the fourth part of the definition is 
the notion of respect for the family’s autonomy and their 
perceptions of risk and of the disorder itself This approach 
has been called nondirectiveness: the counselor leaves all 
decisions about future reproduction up to the family. This 
differs somewhat from the more traditional medical approach, 
in which recommendations for treatment or intervention are 
often made in a more directed fashion. This is an important 
issue, because nondirectiveness sometimes conflicts with the 
broader view of preventive medicine, which might suggest 
that the principal goal of genetic counseling should be the 
reduction of the incidence of genetic diseases. 

Historically, the principle of nondirectiveness developed 
in the arena of reproductive counseling and in the context of 
decisions surrounding prenatal diagnosis. If prevention or 
reduction of disease is the primary goal, then one’s approach 
would logically be more directive. However, the main goal of 
genetic counseling is to help an individual family understand 
and cope with genetic disease, not to reduce the incidence of 
genetic disease. 

Although most geneticists subscribe to the principles of 
autonomy and nondirectiveness, it may be challenging to 
the clinician to be entirely nondirective, simply because of 
the limitations of a time-restricted session. Additionally it is 
challenging as a clinician to avoid reflecting one’s own 
biases. For example, an explanation of nutritional manage¬ 
ment of an infant with a disease detected by newborn 
screening (see Chapter 13) would require a more directive 
approach than the discussion of disease risks in future 
pregnancies. Nondirectiveness may be challenging when the 
consequences of disease are serious, as in high-risk cancer 
counseling (see the discussion of the newly proposed defini¬ 
tion later). In addition, information may be presented quite 
differently in different contexts. Information about Down 
syndrome, for example, may be communicated differently 
depending on whether the diagnosis was made prenatally or 
after the birth of an affected newborn (see Clinical Com¬ 
mentary 15-3) (although we would suggest that the infor¬ 
mation needs to be presented in a balanced manner in 
either situation). 

In 2006, leaders in the field of genetic counseling and 
the National Society of Genetic Counselors reached a con¬ 
sensus on an updated and revised definition of genetic 
counseling: 

Genetic counseling is the process of helping people under¬ 
stand and adapt to the medical, psychological, and 
familial implications of genetic contributions of disease. 
The process integrates the following: (1) interpretation of 
family and medical histories to assess the chance of disease 
occurrence or recurrence; (2) education about inheri¬ 
tance, testing, management, prevention, resources, and 
research; and (3) counseling to promote informed choices 
and adaptation to the risk of the condition. 

The revised definition emphasizes the dual and integrated 
roles of education and counseling. 


The majority of geneticists subscribe to the 
principle of nondirectiveness in reproductive 
counseling: Information about risks, natural 
history, treatment, and outcome is presented 
in a balanced and neutral manner, and 
decisions about reproduction are left to the 
family. 

The facilitation of discussion about reproductive deci¬ 
sion making is central to the task of genetic counseling. 
Several factors are involved in a family’s decision about 
future pregnancies when there is an increased risk. The 
obvious ones are the magnitude of the risk figure and the 
burden or impact of the disorder. However, these are not 
the only significant issues. The individual family’s percep¬ 
tion of the impact of the condition is probably more 
important in their decision making than the professional’s 
perception of the burden. The meaning of children to 
the individual family, according to their own cultural, reli¬ 
gious, or personal preferences, is weighed heavily in the 
reproductive decision-making process. In addition, families 
frequently play out the scenario of coping with a recur¬ 
rence of the condition in another child. Identification of 
these issues for a family often helps to stimulate their own 
discussions. Some families perceive a recurrence risk quali¬ 
tatively rather than quantitatively: They consider them¬ 
selves to be either “at risk” or not, with the actual risk 
estimate being a secondary consideration. The fact that 
there is so much variation in the importance people assign 
to each of these factors (perception of risk, perception of 
impact, meaning of children, and possibility of recurrence) 
underscores the point that the professional should be a 
facilitator and not the decision maker. Awareness of the 
family’s views and values is needed to help in this complex 
undertaking. 

A central task of genetic counseling is to help the family 
cope with the presence of the disorder or its recurrence risk, 
or both. This task is similar to the physician’s support of a 
family dealing with any chronic disease or disability. What is 
unique, perhaps, is the family’s perception of the meaning of 
a genetic disorder (Boxes 15-2 and 15-3). In many acquired 
conditions, such as infections or accidents, the ultimate 
meaning of the condition is externalized. In genetic disorders, 
the condition is more intrinsic to the individual and the 
family; it thus often presents a complex personal dilemma. 
Validation of the plight of families is vital and is probably 
more effective than simplistic attempts to remove guilt. Feel¬ 
ings of guilt and shame are natural to the situation and also 
need acknowledgment. 

The primary care practitioner plays a vital role in the 
ongoing support of families in which a member has a genetic 
disease. Additional support strategies include referral of the 
family to a genetic support and advocacy group, distribution 
of current printed and Internet information on the disorder, 
referral to mental health professionals for ongoing counsel¬ 
ing, and frequent follow-up visits that include time for dis¬ 
cussions of feelings and thoughts. 
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BOX 1 5-2 Birth of a Child with Trisomy 18 


Our daughter Juliett arrived on a beautiful summer afternoon 
in 1984. Late in my pregnancy, an ultrasound had showed an 
enlarged heart, dilated left kidney, and possible malformation 
of the cerebellum. During labor, our daughter's heart rate 
decelerated significantly, and we were given the option of an 
emergency C-section. Without hesitation, we opted for the 
C-section. A drape was hung so that I could not see, and I only 
knew that the baby was born when the pediatrician ran out of 
the room with something in his arms. My husband quickly fol¬ 
lowed, and then I waited for what seemed an eternity. 

Juliett weighed 4 pounds, 6 ounces and was 18 inches long. 

I had graduated as an RN just before Juliett was conceived and 
had worked in a pediatric ICU. I had just enough experience to 
pick up a few of her obvious problems, but many escaped me. 
She was clearly much too thin, and her rib cage looked too 
short and prominent. But compared to the mental images I had 
formed after the ultrasound, I was relieved to see how beauti¬ 
ful she was. Her most striking feature was her incredible blue 
eyes, which were wide open and very alert. Her nose and 
mouth were beautifully formed and very petite. As my husband 
and I sat in awe over her, a neonatologist entered the room. 
He pointed out several characteristics, the only one of which I 
clearly remember was the clenched fist with the index finger 
lying over the middle finger. He concluded that she probably 
had trisomy 18. Of the grim things he rattled off, the only thing 
I remembered was that he said she would be a vegetable and 
that she would most likely die within the next couple of days. 
He then walked away, and we sat there, stunned. In this state 
of grief and turmoil I tried to understand how this clenched fist 
could lead to death, and how these bright, alert eyes could 
belong to a vegetable. 

In the days that followed, I often opened up her fist and laid 
her fingers straight, hoping that the blood tests would not 
confirm the doctor's suspicions. Our bonding with Juliett had 


been instant, and our great desire was to be able to take her 
home before she died. As she began to tolerate feedings and 
was weaned away from oxygen, our pleas to take her home 
were granted. 

We left with no follow-up or care plan. Each time she went 
to sleep, we prayed she would wake up again and that we 
could complete another feeding. At 3 months of age, she 
started to smile at us, and our hopes brightened. We have been 
fortunate to see her outlive the grim statistics, and we have 
learned that there is no clear explanation why some children 
with this condition live longer than others. Juliett's heart was 
enlarged because of a defect similar to a tetralogy of Fallot. 
Mild scoliosis at birth has now progressed to a 100-degree 
lumbar curve and a 90-degree thoracic curve. Despite her many 
physical challenges, Juliett has continued at her slower pace 
to learn and develop new skills. Her personality is delightful, 
and people are often surprised to see how responsive and 
interactive she is. 

We have often been asked if we were afraid to have more 
children. Perhaps we were crazy, but we felt that another like 
Juliett would be great. We have had four more girls. To every¬ 
one's surprise, our fifth child, Camille, was born with Down 
syndrome. With Juliett, the grieving process had been covered 
up with the gratitude we felt that she was even alive. With 
Camille, we experienced the more typical grieving process. 

On the day of Juliett's birth, a pediatrician came forward, put 
his arms around us, and told us he thought she was beautiful 
and to love her for as long as she could be with us. He turned 
her Into a human being with a life to be highly valued. In the 
13 years that have followed, Juliett has seen many doctors. 
Most of them, although they could not cure her problems, gave 
us the most important thing we needed: to know that our 
daughter's life was of great worth and that, if they could, they 
would do anything to help her. 


Genetic counseling includes many themes: 
medical diagnosis and management, deter¬ 
mining risk of recurrence, options for address¬ 
ing the risk, reproductive decision making, and 
support services. 

Numerous studies in the past four decades have attempted 
to evaluate the effectiveness of genetic counseling. The meth¬ 
odology of these studies is complicated, and the evaluation 
of the results depends on one’s interpretation of the goal of 
genetic counseling. A few general points, however, can be 
made. Families tend to recall recurrence risks relatively well. 
A letter sent to them after the visit improves this recall. Fami¬ 
lies who perceive their offspring’s condition as being serious 
and one of impact recall risk figures better. Most studies 
suggest that genetic counseling is relatively effective in 
providing information about the medical aspects and genetic 
risks of the condition. Effectiveness regarding issues sur¬ 
rounding decision making and psychosocial support is 
more complex to define and assess and requires additional 
investigation. 


Genetic Counselors and the Delivery of 
Genetic Counseling 

As the discipline of medical genetics including genetic coun¬ 
seling evolved in the 1970s, it became clear that the delivery 
of this service is multifaceted and time-consuming. Not only 
did the geneticist need to have skills in most specialties of 
medicine, but facilitation of decision making and provision 
of psychological support were also necessary. As a need for 
genetics professionals other than physicians became appar¬ 
ent, a number of genetic counseling training programs 
emerged in the United States and Canada. Currently, 34 
accredited programs in North America provide master’s level 
training in genetic counseling. Genetic counselors have 
become integral partners with physicians and other 
professionals in the delivery of medical genetic services. 
From this growth evolved a professional society. The National 
Society of Genetic Gounselors, and the certifying body, 
the American Board of Genetic Counseling, and the accredit¬ 
ing body, the Accreditation Council of Genetic Counseling. 
Although the range of skills is wide and job descriptions 
vary in different medical centers, genetic counselors have 
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BOX 1 5-3 Raising a Child with Bloom Syndrome 


Tommy was bom via an emergency cesarean section, because 
1 week before his delivery date his fetal movements markedly 
decreased. At birth, he weighed only 4 pounds, and the first 
time that I saw him, he was in an incubator connected to all 
sorts of tubes. He spent his first month of life in the neonatal 
intensive care unit so that his weight gain could be closely 
monitored. Because he was so small, he was fed through a 
feeding tube for many months, and as a consequence, he 
refused to drink from a bottle. Eventually, he overcame his 
aversion to using his mouth to eat but only after substantial 
training. Nevertheless, despite our care. Tommy remained 
small for his age. 

The following summer. Tommy developed dark red marks on 
his cheeks and under his eyes. Our pediatrician referred us to 
a dermatologist, who suspected that the marks on Tommy's 
face were related to his growth failure. We were very sur¬ 
prised. How could these two findings be related? That is when 
we were told that Tommy might have a genetic disorder called 
Bloom syndrome. We hoped that the doctor was wrong, but 
soon thereafter Tommy had a genetic test that measured the 
number of sister chromatid exchanges per cell (see Chapter 2). 
This test confirmed that Tommy had Bloom syndrome. 
Although I insisted that it was a false-positive result, I learned 
to accept that our son had a very rare cancer syndrome. 

We were barraged by questions from family, friends, and 
doctors. As a result, we became very protective of our son and 
his privacy. Nevertheless, there was only so much we could 
do to protect him because he is such a social little boy, loving 


to play with family and friends. This also made choosing an 
appropriate elementary school a very difficult decision for us. 
We expected that children would pick on him because of his 
small size. However, to our surprise, he easily developed 
friendships and adjusted well to his classmates. In fact, the 
problems that he did develop were largely because of his 
misbehavior. Thus, we struggled to find a balance between 
protecting Tommy while not permitting him special privileges 
because of his small stature. 

In our home, we try to treat Tommy like any of our other 
children. One challenge for us Is that because of Tommy's 
small size, people wrongly perceive that he is much younger 
than his chronological age. This is very frustrating for Tommy, 
yet we occasionally reinforce this image because of our 
concern for his safety. For example, although Tommy is 6 years 
old, he weighs only 21 pounds. Thus, he must sit in an infant 
seat when he travels in a car and we explain to Tommy's 
friends that it helps him see through the windows. Another 
safety problem Is that many of the sensors for the automatic 
doors at supermarkets cannot detect his presence and easily 
slam Into him. 

Overall, Tommy has adapted well. He climbs or jumps in 
order to reach things. To keep up with his peers, he often runs, 
hops, or jumps instead of walking. We constantly worry about 
his safety, but we cannot control all that happens to him. To 
date, he has been healthy, and although It seems like we have 
been riding an emotional roller-coaster, we wouldn't trade our 
experiences for anything. 


established themselves as experts in the determination of 
recurrence risk, reproductive decision making, and psycho¬ 
social support (Box 15-4). In the prenatal and cancer genetics 
settings, genetic counselors function relatively independently 
as practitioners and can often bill for their services. 
More recently, genetic counselors have become important 
professionals on research teams and in genetics laboratory 
services. 

Genetic Advocacy Groups 

These support groups can provide critical assistance to fami¬ 
lies who have a member with a genetic disorder (see, for 
example, Genetic Alliance under Internet Resources at the 
end of this chapter). These organizations give the family the 
sense of a fellow traveler in a way that the professional is not 
able to do. The feeling of isolation that often accompanies 
genetic disorders (and rare conditions in general) is poten¬ 
tially alleviated by meeting someone else in the same situa¬ 
tion. Immediate bonds are formed that can assist in the 
coping process. In the last few decades, a clear partnership of 
professionals and persons with genetic disorders and disabili¬ 
ties has developed. Not only have these groups provided a 
needed service, but they also have promoted the establish¬ 
ment of databases and research studies. Referral to a genetic 
support group and distribution of their written information 
are now a routine part of the care and management of ah 
genetic disorders. 


The delivery of genetic services including 
genetic counseling involves a partnership of 
physicians, genetic counselors, and genetic 
advocacy groups. 

Clinical Genetics Evaluation and Services 

With the development of medical genetics as a medical spe¬ 
cialty, clinical genetics services have become part of the 
health-care delivery system. Most university medical centers 
in North America include a genetics clinic whose major 
objective is to provide genetic diagnosis, management, and 
counseling services. In recent years adult genetics services 
and hereditary cancer clinics have developed as part of 
this growth. 

As in all medical visits, evaluation of a person or family for 
a potential genetic condition requires a thorough history and 
physical examination. The history includes information about 
the family’s concerns, the prenatal period, labor, delivery (in 
the assessment of a child), and documentation of family rela¬ 
tionships (the pedigree). The physical examination should 
focus on the physical variations or minor anomalies that 
provide clues to a diagnosis. Additional family members might 
need evaluation for the presence or absence of a genetic disor¬ 
der. Photographs and recording of certain physical measure¬ 
ments are a standard component of the genetic evaluation of 
a person for the consideration of a syndrome. Ancillary tests 
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BOX 1 5-4 An Insider's View of Genetic Counseling 


What Is a Genetic Counselor? 

In its most general usage, the term genetic counselor refers 
to any medical professional who Is professionally qualified 
to provide genetic counseling. Typically, a genetic counselor is 
a genetics professional with a master's degree, with some 
genetic counselors having advanced degrees, generally in 
related fields. Degree programs in genetic counseling provide 
education and clinical training in medical genetics and counsel¬ 
ing. A certified genetic counselor has also passed a certifica¬ 
tion examination administered by the American Board of 
Genetic Counseling or the American Board of Medical 
Genetics. 

What Do Genetic Counselors Do? 

The majority of genetic counselors practice in a clinical setting, 
often working as part of a team that may include medical 
geneticists, other physicians (e.g., obstetricians, oncologists, 
neurologists, social workers, psychologists, nutritionists, 
nurses). Genetic counselors collect and assess medical infor¬ 
mation leading to a diagnosis; provide patient education; 
provide psychosocial support and counseling; provide genetic 
counseling and risk assessment for genetic/genomic testing; 
help physicians with the management of genetic/genomic con¬ 
ditions; and provide patient follow-up. They often triage inqui¬ 
ries and referrals to the service in which they practice. They 
might manage or coordinate clinics and personnel. They are 
active in genetic education programs for medical professionals 
and the lay public. 

In What Settings Do Genetic Counselors Work? 

Genetic counselors work in obstetrics settings, providing coun¬ 
seling for prenatal diagnosis and screening, genetic testing for 
couples with multiple pregnancy loss, diagnosis and manage¬ 
ment of pregnancies affected with abnormalities detected by 
radiological imaging, and alternative reproductive technologies. 
They work in cancer genetics clinics, providing genetic counsel¬ 
ing services for families with suspected or known cancer 
genetic syndromes. Genetic counselors also work In general 
genetics/genomics settings in pediatrics and adult medicine, 
as well as multidisciplinary specialty clinics for groups of 
diseases (e.g., metabolic, craniofacial, bone dysplasia, neuro- 
genetic, cardiovascular) or for single diseases (e.g., Down 
syndrome, neurofibromatosis, hemophilia). 

Many counselors participate In research related to clinical 
genetics/genomics and genetic counseling. For example, pre¬ 
dictive testing for disorders such as Huntington disease and 
hereditary cancers was initiated mainly In research studies 
aimed at assessing the medical, ethical, legal, and social 
consequences. More recently, genetic counselors have 
participated in clinical trials evaluating the transmission of infor¬ 
mation from tests such as whole exome sequencing, which 


provides information not just about the condition tested for, 
but information about many other "actionable" genetic traits. 
These studies are examining patient preferences and the con¬ 
sequences of providing this ancillary Information in a clinical 
setting. Within a research setting, genetic counselors provide 
counseling and help to design, implement, evaluate, and 
publish research. 

Many genetic counselors are employed by laboratories to 
provide an Interface between the laboratory and its clients 
(health-care practitioners, patients, or both), as part of the sales 
force, and to help develop laboratory protocols, policies, and 
educational materials. A small percentage of genetic counsel¬ 
ors are In private practice, and some work in administrative 
positions for the state or federal government or other agencies. 
Many genetic counselors are active at regional and national 
levels in professional organizations, and some counselors help 
to start, maintain, or advise lay advocacy groups for genetic 
disorders. 

What Skills and Personal Qualities Make a Good 
Genetic Counselor? 

A good genetic counselor needs both a strong background In 
the biological sciences and genetics and training in the theory 
and practice of psychosocial techniques (e.g., family systems, 
crisis counseling, interviewing skills, aid with decision making, 
health promotion). Because most genetic counselors provide 
direct patient service, it is essential to work well with people. 
Genetic counselors must work well both independently and on 
a team. There is a high level of responsibility Involved in direct 
and indirect patient care aspects and participation in policy 
decisions, and counselors must successfully learn to handle 
the stress of the difficult situations of the families and agencies 
with whom they work. 

What Is the Future of Genetic Counseling? 

In the last 10 years, clinical genetics/genomics has moved 
solidly Into mainstream medicine. Genetic counselors continue 
to increase in number, but the rapid increase of genetic/ 
genomic tests into all aspects of medicine also requires genetic 
counselors and other geneticists to participate in education and 
support for other health-care providers. As genomic testing 
becomes the norm, genetic counselors will help develop and 
carry out the techniques for clinical interpretation of the vast 
amount of personal data generated. The National Society of 
Genetic Counselors is currently examining workforce issues 
and the Association of Genetic Counseling Program Directors 
is examining advanced training options for certified genetic 
counselors. What is undisputed is the striking emergence of 
medical genetics from an obscure medical subspeclalty to an 
area of knowledge that is fast becoming integrated into every 
field of medicine. 


(Courtesy Bonnie J. Baty, M.S.) 


may be required to document specific physical features (e.g., 
an echocardiogram or MRI for aortic dilatation in Marfan 
syndrome or radiographs to diagnose achondroplasia). 

One important component of clinical data gathered in this 
process is the family history (Box 15-5). The data obtained in 


a family history are often useful in obtaining an accurate 
diagnosis of a condition. For example, a strong family history 
of early-onset coronary disease might indicate the presence 
of a low-density lipoprotein-receptor defect causing familial 
hypercholesterolemia. A family history of early-onset colon 
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BOX 1 5-5 The Family History 


A thorough, accurate family history is an indispensable part of 
a medical evaluation, and a pedigree should be part of the 
patient's chart. At a minimum, the following items should be 
included. 

• The sex of each individual and his or her relationship to other 
family members. This information should be indicated using 
standard pedigree symbols (see Chapter 4). 

• A three-generation family history should be obtained. For 
example, male relatives on the mother's side of the family 
will be especially important when considering an X-linked 
recessive disorder. 

• The age of each individual. A record must be kept of whether 
each individual is affected with the disease in question, and 
inquiries should be made about diseases that may be related 


to the disease in question (e.g., ovarian cancer in a family 
being seen for familial breast cancer). 

• All known miscarriages and stillbirths. 

• The ethnic origin of the family. This is important because 
many diseases vary considerably in prevalence among dif¬ 
ferent ethnic groups. 

• Information about consanguinity. Although it is relatively rare in 
most Western populations, consanguinity is common in many 
of the world's populations, and immigrant populations often 
maintain relatively high rates of consanguinity (see Chapter 4). 

• Changes in family histories. Family members develop newly 
diagnosed diseases, and additional children are born. These 
changes can affect diagnosis and risk estimation, so the 
family history and pedigree should be updated periodically. 


(See Guidelines in Suggested Readings, Bennett et al.) 


BOX 1 5-6 Types of Clinical Genetic Services and Programs 


Center-based genetics clinics 
Outreach clinics 
Inpatient consultations 
Specialty clinics 

• Metabolic clinics 

• Cancer genetics clinics 

• Spina bifida clinics 

• Hemophilia clinics 

• Craniofacial clinics 

• Other single-disorder clinics (e.g., NF1 clinics) 

Prenatal diagnosis programs: Perinatal and reproductive genet¬ 
ics clinics 

• Amniocentesis and chorionic villus sampling clinics 

• Ultrasound programs 


• Maternal serum screening programs (see Chapter 13) 

• Preimplantation diagnosis programs 

• Presymptomatic diagnosis in families (e.g., familial breast 
cancer diagnosis) 

Genetic screening 

• Newborn screening program/follow-up clinics 

• Other population screening programs (e.g., Tay-Sachs 
disease) 

Education and training 

• Health-care professionals, including clinical geneticists and 
genetic counselors 

• General public 

• School system 

• Teratology information services 


cancer could indicate that a gene for familial adenomatous 
polyposis or hereditary nonpolyposis colorectal cancer is 
present in the family. An affected sibling with congenital sen¬ 
sorineural hearing loss suggests the presence of autosomal 
recessive deafness. Family history information can also guide 
the estimation of recurrence risks by helping to determine 
whether a genetic disease has been transmitted by one’s 
parents or has occurred as a new mutation (this is especially 
important for diseases with reduced penetrance). The knowl¬ 
edge and skills required to take an accurate and thorough 
family history are important for all clinicians, not only clini¬ 
cal geneticists. 

Routinely, the clinician sends the family a letter summariz¬ 
ing the diagnosis, natural history, and risk information 
regarding the condition. This letter is a valuable resource for 
the family, because it helps to document the risk information 
for later review. Information regarding lay advocacy groups, 
including the group’s webpage, social network contact infor¬ 
mation, pamphlets, and booklets, is often provided. Follow-up 
visits are recommended depending on the individual situa¬ 
tion. Box 15-6 provides a list of clinical genetics services. 


Clinical genetic evaluations include physical 
examination, detailed family history, ancillary 
tests as needed, and communication of infor¬ 
mation to the family through letters and the 
distribution of published literature. 

The care of persons with genetic disease includes the 
development of guidelines for follow-up and routine care. 
Knowledge of the natural history of a condition, coupled with 
a critical review of screening tests and interventions, can 
provide a framework for health supervision and anticipatory 
guidance. The management plan can subsequently be used 
by the primary care provider. It is primarily for this purpose 
that many of the specialized clinics, such as those for NFl 
or hemophilia, have been established. An example of this 
approach is the management checklist for the health mainte¬ 
nance of infants and children with Down syndrome (see 
Chapter 6). Gene Reviews, a highly important resource for 
practitioners in any setting, provides these guidelines for all 
of the genetic conditions included in their database (see 
Internet Resources below). 
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BOX 1 5-7 Common Indications for 
Genetics Referral 


Evaluation of a person with cognitive disability or develop¬ 
mental delay 

Evaluation of a person with single or multiple malformations: 

question of a dysmorphic syndrome 
Evaluation of a person with a possible inherited metabolic 
disease 

Presence of a possible single-gene disorder 
Presence of a chromosomal disorder, including balanced 
rearrangements 

Person at risk for a genetic condition, including questions of 
presymptomatic diagnosis or cancer risk 
Person or family with questions about the genetic aspects of 
any medical condition 

Couples with a history of recurrent miscarriages 
Consanguinity in a couple, usually first cousin or closer 
relationship 
Teratogen counseling 

Preconceptlonal counseling and risk-factor counseling, includ¬ 
ing advanced maternal age and other potential indications 
for prenatal diagnosis 


As treatment options for mendelian disorders become 
more numerous (e.g., the treatment of aortic dilatation in 
Marfan syndrome; see Chapter 4), the role of clinical geneti¬ 
cists will likely change. Since the turn of the 21st century, 
geneticists have become increasingly involved in the design 
and implementation of clinical trials, and this trend will cer¬ 
tainly continue and change the nature of the practice. 

Traditionally, genetic counseling involves the family who 
comes in with questions about the diagnosis, management, 
and recurrence risk of the condition in question. Thus, in the 
majority of situations, genetic counseling is carried out retro¬ 
spectively. With the increased availability of prenatal, carrier, 
and presymptomatic testing, and the expected rise in the 
personalized health-care model (see Chapter 16), prospective 
genetic counseling will become more common. Box 15-7 lists 
common reasons for referral for genetic evaluation. 

DYSMORPHOLOGY AND 
CLINICAL TERATOLOGY 

Dysmorphology was defined at the beginning of this chapter 
as the study of abnormal physical development (morphogen¬ 
esis). Congenital defects are caused by altered morphogene¬ 
sis. Although the term dysmorphology may seem synonymous 
with teratology, the latter term usually implies the study 
of the environmental causes of congenital anomalies, 
even though its literal meaning does not refer to etiology. 
(The term teratology is derived from teras, the Greek word 
for “monster.” The term dysmorphology was proposed by 
Dr. David Smith as a reaction to the pejorative connotation 
of teratology.) 

Congenital defects represent an important cause of infant 
mortality and morbidity. Current studies indicate that the 


TABLE 1 5-1 Examples of Common 
Multiple Congenital Anomaly and 
Dysplasia Syndromes Referred to a General 
Medical Genetics Clinic 


SYNDROMES 

ETIOLOGY 

Down syndrome 

Chromosomal 

Neurofibromatosis 

Single gene (AD) 

type 1 

Angelman 

Microdeletion of chromosome 

syndrome 

15q: uniparental disomy 

Amnion disruption 

Unknown 

sequence 

Osteogenesis 

Single gene: heterogeneous. 

imperfecta 

AD, AR: type 1 collagen, and 
related genes 

Trisomy 18 

Chromosomal 

VATER association 

Unknown 

Marfan syndrome 

Single gene (AD) 

Prader-Willi 

Microdeletion of chromosome 

syndrome 

15q: uniparental disomy 

Noonan syndrome 

Single gene (AD) 

Williams syndrome 

Microdeletion of chromosome 7 

Achondroplasia 

Single gene (AD) 

Trisomy 13 

Chromosomal 

Turner syndrome 

Chromosomal (45,X) 

Rett syndrome 

X-linked gene 

Rubinstein-Taybi 

Single gene (AD) 

syndrome 

Klippel-Trenaunay 

Heterogeneous: one 
susceptibility syndrome gene 
identified 

Fetal alcohol 

Excessive alcohol 

syndrome 

Cornelia de Lange 

Single gene (AD) 

syndrome 


AD, Autosomal dominant, AR, autosomal recessive: VATER, 
vertebral anomalies, anal atresia, tracheoesophageal fistula, 
esophageal atresia, renal anomalies. 


frequency of medically significant malformations diagnosed 
in the newborn period is about 2%. Investigations that 
have observed children for a longer period demonstrate that 
this frequency increases to about 3% by the age of 1 year. In 
the United States, congenital malformations represent the 
most common cause of mortality during the first year of life. 
Table 15-1 lists some of the most common and important 
malformation syndromes. 

There are several ways to classify congenital abnormalities. 
The most common classification approach is by organ system 
or body region (e.g., craniofacial, limb, heart). More clinically 
useful classification schemes include (1) single defect versus 
multiple congenital anomalies, including a recognizable syn¬ 
drome, (2) major (medically or surgically significant defects) 
versus minor anomalies, (3) categorization by pathogenic 
process, and (4) etiological classification. 

Regarding etiologic classification, there has been only one 
investigation in North America that examined the causes of 
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the malformations in a cohort of infants with congenital 
defects (see Nelson and Holmes on the Readings list). These 
authors list the causes of the various defects in their study 
conducted in Boston. Three key messages emerged from these 
data: the cause of two thirds of the congenital defects is either 
unknown or falls into the multifactorial category (see Chapter 
12); well-estahlished environmental causes of congenital 
malformations are infrequent (less than 4%); and a known 
genetic component is identified in approximately 30% of 
cases. 

Principles of Dysmorphology 

In discussing the basic principles of dysmorphology, it is 
important to define certain key terms. The following defini¬ 
tions, based on pathogenic processes, are used in clinical 
practice: 

• Malformation is a congenital morphologic anomaly of a 
single organ or body part due to an alteration of a primary 
developmental program (e.g., cleft lip, polydactyly). 

• Dysplasia is a morphologic anomaly involving a dynamic 
or ongoing alteration of cellular constitution or tissue 
organization within a specific organ or tissue type (e.g., 
ectodermal dysplasia). 

• Sequence is one or more secondary morphologic anoma¬ 
lies known or presumed to cascade from a single malfor¬ 
mation or dysplasia (e.g., Pierre Robin sequence, a disorder 
in which a primary defect in mandibular development 
produces secondarily a small jaw, glossoptosis, and a cleft 
palate). 

• Syndrome is a pattern of multiple primary malformations 
known or thought to be causally related (e.g., trisomy 13 
syndrome). 

• Deformation is alteration of the shape or position of a 
normally formed body part due to aberrant mechanical 
forces. It usually occurs in the fetal period, not in 
embryogenesis. A deformation is a secondary alteration 
and can be extrinsic, as in oligohydramnios (reduced 
amniotic fluid), or intrinsic, as in congenital myotonic 
dystrophy. 

• Disruption is a congenital morphological defect of an 
organ, part of an organ, or a larger region of the body 
resulting from the breakdown of a body structure that had 
a normal developmental potential. It is by definition a 
secondary malformation (e.g., secondary limb defect 
resulting from a vascular event). 

Note that malformations and dysplasias are morphologic 
anomalies whose primary events occur in embryogenesis and 
histogenesis, whereas disruptions and deformations are by 
definition secondary. All of these terms were recently updated 
and clarified by an international working group called the 
Elements of Morphology; see the Readings at the end of the 
chapter. 

When evaluating a child with a congenital malformation, 
the most important question is whether the abnormality rep¬ 
resents a single, isolated anomaly or is instead one compo¬ 
nent of a broader, organized pattern of malformation (i.e., a 
syndrome). This concept is exemplified by the evaluation of 


a baby with a cleft lip. If a baby has an isolated, nonsyndromic 
cleft lip with no other malformations, the discussion of 
natural history, genetics, prognosis, and management is 
markedly different than if the baby’s cleft lip is one feature of 
the trisomy 13 syndrome (see Chapter 6). The former condi¬ 
tion can be repaired surgically and has a relatively low 
recurrence risk (see Chapter 12) and few associated medical 
problems. Trisomy 13 is a serious chromosomal disorder. In 
addition to orofacial clefts, these infants usually have a con¬ 
genital heart defect and central nervous system malforma¬ 
tions. More importantly, 50% of children with trisomy 13 die 
in the newborn period, and 90% die before 1 year of age. 
Thus, prediction of the natural history and medical manage¬ 
ment in these two examples are quite different. 

Another example is a child with cleft lip who also has pits 
or fistulas of the lower lip. The combination of orofacial clefts 
and lip pits signifies an autosomal dominant condition called 
the van der Woude syndrome. Although the natural history 
of this condition differs little from that of nonsyndromic cleft 
lip, the discussion of genetic recurrence risks is much differ¬ 
ent. In the evaluation of a child with van der Woude syn¬ 
drome, it is very important to determine whether one of the 
parents carries the gene. If so, the sibling recurrence risk is 
50%. This is much greater than the 4% sibling recurrence risk 
usually given for nonsyndromic cleft lip (see Chapter 12). 
Because van der Woude syndrome has variable expression 
and often is manifested only by lip fistulas, it is quite com¬ 
monly overlooked. Thus, a careful physical examination, 
combined with a knowledge of the genetics of isolated mal¬ 
formations and syndromes, is necessary to determine accu¬ 
rate recurrence risks. 

The most important question to ask when eval¬ 
uating a child with a congenital malformation 
is whether the defect is isolated or part of a 
syndrome pattern. 

Increasing knowledge of the pathogenesis of human con¬ 
genital defects has led to a better understanding of the devel¬ 
opmental relationship of the defects in multiple congenital 
anomaly patterns. Some well-established conditions that 
appear to be true syndromes at first glance are really a con¬ 
stellation of defects consisting of a primary malformation 
with its secondary, localized effects (i.e., a sequence). In a 
sequence, the pattern is a developmental unit in which the 
cascade of secondary pathogenic events is well understood. 
In contrast, the pathogenic relationship of the primary mal¬ 
formations in a syndrome is not as well understood, although 
pathogenesis may be clarified when the syndrome is the result 
of the pleiotropic effects of a single gene (e.g., Marfan syn¬ 
drome; see Chapter 4). 

One of the best examples of a sequence is the Potter phe¬ 
notype or oligohydramnios sequence. It is currently believed 
that any significant and persistent condition leading to 
oligohydramnios can produce this sequence, whether it be 
intrauterine renal failure due to kidney malformations (such 
as missing kidneys, renal agenesis) or chronic leakage of 
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ETIOLOGY PATHOGENESIS PHENOTYPE 



FIG 15-3 The oligohydramnios sequence. Oligohydramnios 
can arise from a number of distinct causes. It produces a 
constellation of secondary phenotypic features. 


amniotic fluid. The fetus will develop a pattern of secondary 
growth deficiency, joint contractures (deformations), charac¬ 
teristic facial features, and pulmonary hypoplasia (Fig. 15-3). 
Before the cause of these features was understood, the 
phenotype was termed the Potter syndrome. Now, with the 
understanding that all of the features are secondary to oligo¬ 
hydramnios, the disorder is more properly called the oligohy¬ 
dramnios sequence. As with any malformation, the renal 
defect can occur by itself, or it can be a part of any number 
of syndromes in which renal malformations are component 
features (such as autosomal recessive Meckel-Gruber syn¬ 
drome or the more common nonsyndromic multifactorially 
caused disorder, bilateral renal agenesis). Distinguishing 
between syndromes and sequences can often improve under¬ 
standing of the underlying cause of a disorder and aid in 
prediction of prognosis. 

It is important to distinguish between a 
sequence, which is a primary defect with sec¬ 
ondary structural changes, and a syndrome, 
which is a collection of malformations whose 
relationship to one another tends to be less 
well understood. 

Clinical Teratology 

A teratogen is an agent external to the fetus’s genome that 
induces structural malformations, growth deficiency, and/or 
functional alterations during prenatal development. Although 
teratogens cause only a small percentage of all congenital 
defects, the preventive potential alone makes them worthy of 
study. Table 15-2 lists the well-established human teratogens 
due to medications or drugs. Other categories of agents 
include infectious diseases during pregnancy (e.g., cytome¬ 
galic virus), maternal disease states (e.g., maternal diabetes 
mellitus), and ionizing radiation. 


It is important to understand the reasoning process that 
leads to the designation of a substance or exposure as a 
teratogen. This process is based on an evaluation of epide¬ 
miological, clinical, biochemical, and physiological evidence. 
Animal studies can also help to establish whether an agent is 
teratogenic and provide insight into pathogenesis. 

The key issues involved in determining whether an agent is 
teratogenic are summarized in Clinical Commentary 15-4. A 
key clinical point is that it is common for families to ask their 
health-care professionals questions about the risks of certain 
exposures during pregnancy. When faced with such a question, 
the clinician has a number of options. One is to review the 
literature on particular exposures in humans, make a judgment 
regarding the degree of risk, and then provide counseling. 
Various online and printed sources available to the clinician are 
listed at the end of the chapter. An alternative is to refer the 
patient to a teratology information service or to a clinical 
genetics unit. Because of the complexity of these issues, teratol¬ 
ogy information services have sprung up throughout the 
United States, Canada, and Europe. The Organization of Tera¬ 
tology Information Services (OTIS) provides current and com¬ 
prehensive information on exposures in pregnancy, lists the 
available teratology information services in North America, 
and provides connections to a regional resource. 

Teratogens are external agents that cause a 
small but important percentage of congenital 
malformations. It is often difficult to prove con¬ 
clusively that a substance is teratogenic. 

Prevention of Congenital Malformations 

Because the majority of structural defects have no obvious 
cause, their prevention presents a challenge. An example is 
fetal alcohol syndrome (FAS), one of the more common pre¬ 
ventable causes of human malformation (Clinical Commen¬ 
tary 15-5). The institution of rubella immunization programs 
and the preconceptional administration of folic acid exem¬ 
plify successful prevention (Clinical Commentary 15-6). 

Preconceptional counseling is a model for primary pre¬ 
vention. Women who have diabetes mellitus, phenylketon¬ 
uria, or systemic lupus erythematosus (an autoimmune 
disorder involving production of autoantibodies that affect 
multiple organs) can decrease their risk of producing an 
infant with a structural defect through appropriate precon¬ 
ceptional management, a primary prevention strategy. Exam¬ 
ples of secondary and tertiary levels of prevention, respectively, 
include newborn screening for hearing loss and the provision 
of high-quality medical care for infants and children with 
congenital malformations. Instituting appropriate guidelines 
for health supervision and anticipatory guidance can decrease 
some of the complications of these disorders. Also important 
is public education regarding the limitations of scientific 
knowledge and the emotional difficulties of families in which 
a child has a congenital defect. Such information has the 
potential to diminish anxiety, improve the family’s coping 
process, and reduce the stigma that surrounds congenital 
malformations and genetic disorders. 
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TABLE 15-2 Well-Established Human Teratogens 



DRUG 

POTENTIAL DEFECT 

CRITICAL 

EXPOSURE 

PERIOD 

PERCENTAGE OF 
THOSE EXPOSED 
WHO ARE AFFECTED 

Angiotensin-converting 

Renal dysgenesis: oligohydramnios: skull ossification 

Second to third 

NE 

enzyme (ACE) inhibitors 

defects 

trimester 


Alcohol, chronic 

Craniofacial and central nervous system anomalies: 
heart defects 

<12 wk 

10-15 


Low birth weight: developmental delay 

>24 wk 

NE 

Aminopterin 

Spontaneous abortion 

<14 wk 

NE 


Craniofacial anomalies; limb defects: 
craniosynostosis: neural tube defects 

First trimester 

NE 


Low birth weight 

>20 wk 

NE 

High-dose androgens or 

Masculinization of external female genitalia 

>10 wk 

0.3 

norprogesterones 

Carbamazepine 

Spina bifida 

<30 days after 
conception 

»1 

Carbimazole/methimazole 

Hypothyroidism: goiter. 

Choanal atresia, scalp defects, other anomalies? 

NE 

NE 

Cocaine 

Abruptio placentae 

Second to third 
trimester 

NE 


Intracranial hemorrhage: premature labor and delivery 

Third trimester 

NE 

Diethylstilbestrol 

Uterine abnormalities: vaginal adenosis; vaginal 
adenocarcinoma: cervical ridges; male infertility 

<12 wk 

NE 

Fluconazole (high-dose) 

Limb and craniofacial defects 

First trimester 

NE 

Isotretinoin 

Fetal death; hydrocephalus: central nervous system 
defects; microtia or anotia; small or missing 
thymus; conotruncal heart defects: micrognathia 

>15 days after 
conception 

45-50 

Methotrexate 

Craniosynostosis: underossified skull; craniofacial 
dysmorphology; limb defects 

6-9 wk after 
conception 

NE 

Mycophenolate mofetil 

Ear malformation, orofacial clefts, and heart defects 

First trimester 

NE 

Penicillamine 

Cutis laxa, joint contractures 

NE 

NE 

Phenytoin 

Craniofacial anomalies; hypoplastic phalanges and 
nails 

First trimester 

10-30 

Solvents, abuse (entire 

Small size for gestational age; developmental delay 


NE 

pregnancy) 

Streptomycin 

Hearing loss 

Third trimester 

NE 

Tetracycline 

Stained teeth and bone 

>20 wk 

NE 

Thalidomide 

Limb deficiencies; ear anomalies 

38-50 days 
post-LMP 

15-25 

Thiouracil 

Spontaneous abortion 

First trimester 

NE 


Stillbirth 

Goiter 

>20 wk 

NE 

NE 

Trimethadione 

Developmental delay; V-shaped eyebrows; low-set 
ears: irregular teeth 

First trimester 

NE 

Valproic acid 

Spina bifida 

<30 days after 
conception 

«1-2 


Craniofacial anomalies: preaxial defects 

First trimester 

NE 

Warfarin 

Nasal hypoplasia: stippled epiphyses 

6-9 wk 

NE 


Central nervous system defects secondary to 
cerebral hemorrhage 

>12 wk 

NE 


Data from Martinez LP, Robertson J, Leen-Mitchell M: Environmental causes of birth defects. In: Rudolph CD, Rudolph AM, Hostetter MK, 
Lister G, Siegel NM, eds: Rudolph's Pediatrics. 21st ed. New York: McGraw-Hill: 2002:9. 

NE, Not established: post-LMP, after last menstrual period. 

*Other established teratogens include maternal infections (rubella, cytomegalovirus, toxoplasmosis, varicella, Venezuela equine encephalitis, 
syphilis, parvovirus), maternal disease states (diabetes mellitus, phenylketonuria, systemic lupus erythematosus. Graves' disease), and 
ionizing radiation. 









CLINICAL COMMENTARY 15-4 
The Bendectin Saga 

Bendectin, or doxyclamine, was an agent introduced in the 
1960s for "morning sickness" (nausea and vomiting during 
pregnancy). The agent was particularly efficacious, and during 
the 1970s about one third of American women took Bendectin 
at some time during their first trimester of pregnancy. 

Bendectin has probably been studied more than any other 
individual pregnancy medication. Several epidemiological 
studies have found no conclusive evidence of an increased risk 
of congenital malformations with use of Bendectin during preg¬ 
nancy. The few studies that have demonstrated weak associa¬ 
tions between Bendectin and birth defects did not reveal any 
consistent patterns. Animal studies also indicate no associa¬ 
tion. Despite these data, a number of lawsuits were filed in the 
1980s against the company that marketed Bendectin. As a 
result, the company removed the drug from the market. 

The reasoning process used to determine causation in such 
cases is complex. Before judgments about etiology are made, 
a critical review of the literature is required. The available epi¬ 
demiological studies must be evaluated in terms of their meth¬ 
odology, design, and biases. Up-to-date knowledge of the 
etiology and pathogenesis of congenital malformations must 
be included in the design of the study. The basic principles of 
teratology must be applied. This includes an evaluation of the 
critical period (i.e., did exposure occur during the period of 
pregnancy in which the malformed fetal structures were devel¬ 
oping?). Clinical evidence includes the search for a pattern of 
defects (i.e., a specific syndrome), because all well-established 
teratogens produce consistent patterns. Animal models never 
prove causation in humans but can provide supportive evi¬ 
dence and information about pathogenesis. In addition, the 
proposed effect of the agent should be biologically plausible. 


A review of the evidence regarding Bendectin shows that it 
meets none of these criteria. Indeed, because of its extensive 
testing, Bendectin satisfies standard criteria for safety as well 
as any known medication. What, then, could have produced 
this litigation? An important part of the answer to this question 
involves the coincidental occurrence of congenital malforma¬ 
tions and Bendectin exposure. Considering that a major con¬ 
genital malformation will be diagnosed in 3% of infants by 1 
year of age and that about one third of women were taking 
Bendectin in the first trimester of pregnancy, then about 1% 
(1/3 X 3%) of all pregnancies in the 1970s would have experi¬ 
enced the co-occurrence of these two events by chance alone. 
Because about two thirds of congenital malformations have no 
known cause, it is not surprising that many families of children 
with these disorders (and their lawyers) would attribute them 
to Bendectin use. 

Bendectin was removed from the market in 1983. Since then, 
the percentage of babies born with congenital malformations 
has remained the same, and the number of women admitted 
to hospitals for complaints of morning sickness has doubled. 

It is very difficult to prove epidemiologically that any exposure 
is "safe." The statistical power of such studies does not permit 
an absolute statement that there is no effect. All that can be 
demonstrated is that there is no definitive evidence that a particu¬ 
lar agent (in this case, Bendectin) causes an adverse outcome. 
Blanket reassurance or absolute statements of complete safety 
are not appropriate. On the other hand, when the evidence is rela¬ 
tively conclusive, as in the case of Bendectin, it is clinically appro¬ 
priate to have a reassuring tone in discussing exposure in the 
pregnancy setting. Notably the generic form of Bendectin has 
recently been rereleased onto the Canadian markets. 


CLINICAL COMMENTARY 15-5 
Fetal Alcohol Syndrome 

Among the human teratogens, one of the most common and 
potentially preventable exposures is excessive alcohol con¬ 
sumption. Women who are chronic alcoholics are at significant 
risk of bearing a child with the fetal alcohol syndrome (FAS). 
This condition consists of prenatal and postnatal growth defi¬ 
ciency, microcephaly (small head), a wide range of develop¬ 
mental disabilities, and a constellation of facial alterations (Fig. 
15-4). The more distinctive and consistent facial features 
include short palpebral fissures, underdeveloped philtral folds, 
and a thin upper lip vermilion. (See the Elements of Morphology 
reference at the end of the chapter for the precise definition 
of these terms.) Although most of these signs are not specific 
for FAS, their mutual occurrence in the context of maternal 
alcohol abuse allows the clinician to make the diagnosis. 

In addition to these findings, infants and children with FAS 
are at risk for a number of structural defects, including congeni¬ 
tal heart malformations, neural tube defects, and renal anoma¬ 
lies. Most children with FAS have a mild degree of developmental 
delay, ranging from mild cognitive disability to learning difficul¬ 
ties. Additionally many children with FAS have significant 
behavioral problems. 

There are still many unanswered questions regarding alcohol 
use in pregnancy. These include the extent of genetic predis¬ 
position to FAS, the risk of binge drinking, the role of moderate 
and social drinking, and the safe level of alcohol in pregnancy. 
Although there is no conclusive evidence that light drinking in 
early pregnancy is harmful, avoidance of alcohol during preg¬ 
nancy is the most prudent approach. 



FIG 15-4 A 2-year-old child with fetal alcohol syndrome. 
Note the depressed nasal root, short palpebral fissures, 
smooth philtrum, and thin upper lip vermilion. 
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CLINICAL COMMENTARY 15-6 

Folate and the Prevention of Neural Tube Defects 

The primary prevention of congenital malformations is an 
important goal of clinical genetics. Because the ultimate cause 
of most congenital malformations is currently unknown, there 
are relatively few opportunities for primary prevention. A recent 
approach to the prevention of congenital malformations is the 
periconceptional use of folate and multivitamin regimens to 
prevent the occurrence and recurrence of neural tube defects 
(NTDs). 

NTDs consist of malformations of the developing neural 
tube and express themselves as anencephaly, encephalocele, 
and spina bifida (see Chapter 12). Their impact is serious: 
Anencephaly is invariably fatal, and the medical complications 
of spina bifida (lower limb paralysis, hydrocephalus, urinary 
obstruction) are significant. Because of the potential influence 
of nutritional elements on embryogenesis, a series of epide¬ 
miological studies were performed in the 1970s and 1980s. 
With one exception, they demonstrated that the use of vita¬ 
mins and folate in the periconception period reduced the 
recurrence risk of spina bifida and anencephaly in families that 
had previously had a child with one of these conditions. In 
1991 the Medical Research Council of the United Kingdom 
published a double-blind study* in which 4 mg of folate with 
or without vitamins was administered to women who had 
had a child with an NTD. The group receiving folate alone 
experienced a 70% reduction in the recurrence of these 
malformations in their offspring. In 1992, a Hungarian group 
demonstrated the usefulness of vitamins and folic acid in 


preventing the initial occurrence of NTDs. In this study, two 
groups of women, one receiving vitamins and folic acid and 
the other not, were followed for the duration of their pregnan¬ 
cies. The vitamin-and-folic acid regimen significantly decreased 
the occurrence of NTDs. Numerous additional studies have 
confirmed these results. 

Although it is still not clear whether the purported protective 
effect is due to folic acid or a combination of folic acid and 
other vitamins, these data indicate that periconceptional 
vitamin use is an effective prevention strategy. The mechanism 
for the apparent effect remains unknown. Nonetheless, the 
encouraging results of these studies have prompted the 
Centers for Disease Control and Prevention to publish two 
recommendations regarding the use of folate. The first is that 
all women who have previously had a child with an NTD should 
take 4 mg/day of folic acid if they are planning to become 
pregnant. The second recommendation is that all women of 
reproductive age should take 0.4 mg/day of folic acid (the 
amount available in a typical multivitamin tablet) throughout 
their reproductive years. The latter recommendation is prudent 
in light of the fact that approximately half of pregnancies in the 
United States are unplanned. These recommendations have 
led to the folic acid fortification of wheat and other grain prod¬ 
ucts in the United States and other nations. In the past decade, 
studies in a variety of countries throughout the world have 
demonstrated a decrease in the occurrence of NTDs after the 
initiation of a food fortification program. 


*A double-blind study is one in which, during the treatment phase of the study, neither the subjects nor the investigator knows which 
subjects are receiving an active ingredient and which are receiving placebo. 


BIOETHICS AND MEDICAL GENETICS 

With new discoveries and advances in medical technology 
come new choices for patients, families, and society. As 
medical genetics came to be defined as a medical specialty 
during the past several decades, a number of new issues in 
bioethics also emerged. Because of the significance and com¬ 
plexity of these issues, a significant portion of the budget of 
the Human Genome Project was devoted to the ethical, legal, 
and social implications of human genetics. Some of these 
implications have been touched upon previously in this text¬ 
book (e.g., genetic testing, gene therapy, embryonic stem cell 
research). Our goal here is to provide a sampling of the major 
ethical questions currently confronting the medical and 
genetics communities. 

The combination of advances in prenatal diagnosis (e.g., 
ultrasound, amniocentesis), the ability to determine the 
human karyotype, and the option of pregnancy termination 
set the stage for the rise of prenatal diagnosis as a clinical 
service in the 1970s. By that time, most tertiary medical 
centers in developed nations offered amniocentesis for 
various indications, most commonly advanced maternal age 
(see Chapter 13). In the early discussions of the ethics of 
prenatal diagnosis, the central controversy involved a woman’s 
(or couple’s) right to terminate a pregnancy. In the 1990s, the 


issue took on a new dimension with the concern that persons 
with disabilities may be devalued by society when prenatal 
diagnosis can lead to the selective termination of fetuses with 
a disability. Similar concerns revolve around the issue of with¬ 
drawal of support for newborns with serious birth defects 
(e.g., trisomy 13, some neural tube defects). The main prin¬ 
ciples guiding decisions about these issues are to consider the 
best interests of the child and to provide genetic counseling 
so that the parents can make an informed decision. 

Ethical concerns have arisen about other types of genetic 
testing, including carrier testing and presymptomatic testing 
(see Chapter 13). Genetic tests differ from other types of 
medical testing because genes (including mutations that pre¬ 
dispose to disease) can be shared in families. Thus, a genetic 
test performed in one person might reveal risk information 
about a relative who might not wish to know about his or her 
risk (e.g., testing of a young adult for an autosomal dominant 
disease might indicate that one of the parents must have 
transmitted the disease-causing mutation). In addition, many 
people perceive that their genetic inheritance is very much an 
intrinsic part of themselves (and their families). Genetic risk 
is often misperceived as “unchangeable.” All of these factors 
can lead to unfair stigmatization of individuals, families, and 
even whole populations. To help counter this, care providers 
must be sensitive to the needs and concerns of individuals 
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and their families. They should avoid making value judg¬ 
ments that could lead to or reinforce stigmatization. They 
should dispel notions of genetic determinism, making it clear 
to families that genes are but one part of the cause of a 
disease. Nongenetic factors, which often can be altered, can 
also play an important role. As with all medical information, 
privacy and confidentiality must be respected. 

Genetic testing also raises the specter of discrimination by 
insurance companies or employers. Insurance companies have 
long collected information about family history as a means of 
assessing risk. In some cases (e.g., an individual at risk for 
inheriting a BRCAl mutation), a genetic test can provide a 
much more accurate measure of disease risk. At-risk persons 
are understandably concerned about the possibility of losing 
their insurance benefits or their employment because of the 
outcome of a genetic test. A paradoxical result is that some 
choose not to be tested, even when it could lead to a potentially 
life-saving intervention. Insurance providers and employers 
reason that denying coverage (or increasing premiums) to 
at-risk individuals serves the general interest by minimizing 
costs. Others respond that, unlike lifestyle choices such as ciga¬ 
rette smoking or exercise, one does not choose one’s genes, so 
it is unfair to discriminate on the basis of genetic tests. 

Because of concerns in the United States regarding dis¬ 
crimination in employment and health insurance, the genetics 
community and supportive legislators worked together during 
the early 2000s to enact legislation to ensure the confidentiality 
of genetic test results. The Genetic Information Nondiscrimi¬ 
nation Act (GINA) was formulated to prevent the discrimina¬ 
tory use of genetic test results by employers or health insurers. 
GINA became a federal law in 2008 and took effect in 2009. 

Preimplantation genetic testing (see Chapter 13) has also 
been the focus of ethical debates. For example, this form of 
testing can be used to determine the sex of an embryo. Indeed, 
an early application was to avoid implanting male embryos 
that had an increased risk of carrying an X-linked recessive 
mutation. Many scientists and ethicists believe that preim¬ 
plantation diagnosis solely for sex selection is inappropriate, 
and the practice is currently banned in the United Kingdom. 
Preimplantation diagnosis could also be used to select pref¬ 
erentially for embryos that do carry disease-causing muta¬ 
tions. For example, an embryo that is homozygous for 
mutations causing autosomal recessive deafness could be 
selected to match the phenotypes of deaf parents (a case was 
reported in which deaf parents deliberately conceived a deaf 
child by artificial insemination). Such applications might put 
the interests of the parents and the interests of the child in 
conflict. Another controversial application of preimplanta¬ 
tion diagnosis was in the selection of a human leukocyte 
antigen (FILA)-matched embryo that could later provide 
bone marrow cells for an older sibling who had Fanconi 
anemia. It is argued that such persons could feel that their life 
was devalued because they had been selected partly on the 
basis of their suitability as a bone marrow donor. 

Questions have arisen about the genetic testing of children. 
When such testing can lead to useful diagnostic measures or 
interventions, it may be warranted. An example would be the 


genetic testing of children at risk for inheriting a mutation in 
the adenomatous polyposis coli (APC) gene. As discussed in 
Chapter 11, colonoscopy should begin in gene carriers by age 
12 years and is potentially life-saving. In contrast, childhood 
diagnosis of Huntington disease currently provides no preven¬ 
tive or therapeutic benefits and increases the potential for 
anxiety and stigmatization. A consensus has emerged that 
childhood genetic testing should not be pursued unless it pro¬ 
vides an avenue to clinically beneficial intervention. 

With the advent of whole exome and genome sequencing 
in the clinical arena (see Chapter 13), significant controversy 
arose in the field surrounding the reporting of incidental (or 
secondary) findings that were commonly detected in the 
testing process. These findings represent variations unrelated 
to the original indication for ordering the study but of poten¬ 
tial medical value in patient care. Some geneticists felt that all 
variations should be reported to the patient (or family) while 
others argued that only “medically actionable” findings 
should be shared. A Working Group of the ACMG was con¬ 
vened in 2012, and after a year-long process of consensus 
provided a set of recommendations now widely adopted in 
the field (see Suggested Readings, Green et al). The panel 
recommended that laboratories performing these new tests 
analyze and report variants of genes for a specified group of 
56 conditions; they encouraged the development of an 
ongoing process for updating the guidelines. 

Intense controversy has surrounded the issues of cloning 
and embryonic stem cell research (see Chapter 13). It is 
important to repeat the distinction between reproductive 
cloning and therapeutic cloning. Because of the high failure 
rate of reproductive cloning in other mammals, and because 
the benefits of reproductive cloning are unclear, the scientific 
community is nearly unanimous in its opposition to the cre¬ 
ation of human beings through reproductive cloning. The use 
of cloning to create embryonic stem cells for therapeutic 
purposes (e.g., pancreatic islet cells for patients with diabetes 
or neurons for those with Alzheimer disease) is more contro¬ 
versial and, like pregnancy termination, involves highly 
charged questions about the definition of human life and the 
limits of medical intervention. As noted in Chapter 13, these 
issues require informed and thoughtful input from the scien¬ 
tific community and from patient advocacy groups, ethicists, 
philosophers, legal scholars, the clergy, and others. 

The science of genetics is no stranger to controversy and 
even to abuse. The eugenics movement (Greek, “good birth”), 
popular in the United States and some European countries in 
the early part of the 20th century, advocated both “positive 
eugenics” (preferential reproduction of those deemed to be 
genetically more fit) and “negative eugenics” (the prevention 
of reproduction of those thought to be genetically less fit). 
Eugenics, in concert with the political thinking of the times, 
led to a series of abuses that culminated in the atrocities of 
Nazi Germany. These events are a sobering reminder of the 
potential for misuse of genetic information. Geneticists must 
help to ensure that their science is used for maximum benefit 
while adhering to the time-honored maxim primum non 
nocere (“first, do no harm”). 
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I STUDY QUESTIONS 


1. Allen, a 40-year-old man, comes to your office because 
he is concerned about his family history of heart disease. 
His father had a fatal myocardial infarction (Ml) at 45 
years of age, and his paternal grandfather had a fatal MI 
at age 47 years. Allen’s father had two brothers and two 
sisters. One of the brothers had an MI at age 44 years, 
and one of the sisters had an MI at age 49 years. Allen’s 
mother had a brother and sister, both of whom are still 
alive. Allen’s mother’s parents both survived into their 
80s and died of “natural causes.” Draw a pedigree sum¬ 
marizing the information you have taken about Allen’s 
family, and make a recommendation for further study 
and/or treatment. 

2. Mary’s two brothers and her mother’s brother all had 
Duchenne muscular dystrophy (DMD) and are now dead. 
Based on only this information, what is the probability 
that Mary is a heterozygous carrier for this disorder? What 


is the probability that she will produce affected offspring? 
Suppose Mary has a serum creatine kinase (CK) test and 
is told that her level is above the 95th percentile for homo¬ 
zygous normal individuals. Approximately two thirds of 
DMD carriers have CK levels above the 95th percentile. 
Given this information, use Bayes theorem to calculate the 
probability that Mary is a carrier and the probability that 
she will produce affected offspring. 

3. Bob’s father had Huntington disease and is now deceased. 
Bob is now 51 years old and has no symptoms of Hun¬ 
tington disease. Age-of-onset curves show that approxi¬ 
mately 85% of individuals with an affected father show 
symptoms by this age (the percentage is slightly lower, 
about 80%, if the mother is affected). Based on this infor¬ 
mation, use Bayes theorem to estimate the probability that 
Bob inherited the Huntington disease mutation from his 
father. 
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5' cap A chemically modified guanine nucleo¬ 
tide added to the 5' end of a growing mRNA 
molecule. 

acceptor site AG sequence that defines the 
splice site at the 3' end of an intron. 
acetylation The addition of an acetyl group to 
a molecule (as in histone acetylation), 
acrocentric A chromosome whose centromere 
is close to the end of one arm. 
activator A specific transcription factor that 
binds to coactivators and to enhancers to 
help regulate the transcriptional activity of 
certain genes. 

adaptive immune system The portion of the 
immune system that is capable of changing its 
DNA sequence to bind foreign particles more 
effectively. It includes the humoral and cellular 
components. Compare with innate immune 
system. 

addition rule A law of probability that states 
that the probability of one event or another 
event occurring is derived by adding the prob¬ 
abilities of the two events together, assuming 
the events occur independently of one another, 
adenine One of the four DNA bases (abbrevia¬ 
tion: A). 

adeno-associated virus A type of parvovirus 
sometimes used as a vector for somatic cell 
gene therapy. 

adenovirus A double-stranded RNA virus 
that is sometimes used in gene therapy, 
adjacent segregation A meiotic segregation 
pattern in which the pairing of translocated 
chromosomes leads to unbalanced gametes. 
Compare with alternate segregation, 
affected sib-pair method A linkage analysis 
method in which sib pairs who are both 
affected with a disease are assessed for the 
extent to which they share alleles at various 
marker loci. If alleles are shared significantly 
more often than the expected 50%, linkage of 
the disease to the marker is indicated, 
affinity The binding power of an antibody with 
an antigen (low affinity indicates poor binding; 
high affinity indicates precise binding), 
affinity maturation A phase of B-cell devel¬ 
opment in which the cell undergoes somatic 
hypermutation so that some cells are formed 
that can achieve high-affinity binding to pep¬ 
tides from a pathogen. 

allele Conventional abbreviation for “allelo¬ 
morph.” Refers to the different forms, or 
DNA sequences, that a gene can have in a 
population. 

allele-specific oligonucleotide A short DNA 
sequence, usually 18 to 20 nucleotides, that 
can hybridize with either disease-causing or 
normal DNA sequences. Used in direct diag¬ 
nosis of mutations. 


alpha-fetoprotein Albumin-like protein pro¬ 
duced by the fetus. The level of alpha- 
fetoprotein is elevated in pregnancies with 
neural tube defects and may be decreased in 
pregnancies with Down syndrome, 
alpha-satellite DNA A type of repetitive DNA 
sequence found near centromeres, 
alternate segregation A meiotic segregation 
pattern in which the pairing of translocated 
chromosomes leads to balanced gametes. 
Compare with adjacent segregation, 
alternative splice site A variation in the loca¬ 
tion of intron-exon splice sites in some genes 
that allow one gene to produce multiple dif¬ 
ferent protein products. 

Alu family A major group of dispersed repeti¬ 
tive DNA sequences. 

amino acids The major building blocks of 
polypeptides. Each of the 20 amino acids is 
encoded by one or more mRNA codons, 
amniocentesis A prenatal diagnostic technique 
in which a small amount of amniotic fluid is 
withdrawn transabdominally at about 16 weeks 
after the last menstrual period. Fetal cells can 
then be tested for some genetic diseases, 
amniocentesis, early Amniocentesis carried 
out at approximately 12 to 14 weeks after the 
last menstrual period. 

amniocyte Fetal cell found in the amniotic 
fluid. 

anaphase One of the stages of cell division, in 
which sister chromatids separate and move 
toward opposite sides of the cell, 
anemia A deflciency of red blood cells or 
hemoglobin. 

aneuploid The condition in which the number 
of chromosomes is not a multiple of 23, as in 
trisomy and monosomy. Compare with 
euploid (noun: aneuploidy). 
angiogenesis Formation of new blood vessels, 
antibody Molecule produced by plasma cells; 

antibodies bind to invading antigens, 
anticipation A feature of pedigrees in which a 
disease is seen at earlier ages or with increased 
severity in more recent generations, 
anticodon A three-nucleotide DNA sequence 
in a tRNA molecule that undergoes comple¬ 
mentary base pairing with an mRNA codon, 
antigen A molecule that provokes antibody 
formation (from “antibody generator”), 
antigen-presenting cell (APC) A cell that 
engulfs foreign bodies, digests them, and then 
displays the foreign antigens on its cell surface 
for recognition by T lymphocytes, 
antisense strand In a double-stranded DNA 
molecule, the strand from which mRNA is 
transcribed. See sense strand, 
antisense therapy A type of somatic cell 
gene therapy in which an oligonucleotide is 


synthesized that can hybridize with a mutant 
mRNA sequence, blocking its translation into 
protein. 

apoptosis Programmed cell death. 

array CGH (aCGH) See comparative genomic 
hybridization. 

association The co-occurrence of two traits or 
events more often than expected by chance, 
autoimmunity The condition in which one’s 
immune system attacks one’s own cells, 
autoradiogram The image produced by 
exposing a radioactively labeled substance, 
such as a probe, to x-ray film (used, for 
example, in detecting RFLPs and in perform¬ 
ing in situ hybridization), 
autosomes The 22 pairs of chromosomes 
excluding the sex chromosomes (X and Y). 
axis specification Definition, during embry¬ 
onic development, of the major axes of the 
embryo: ventral/dorsal and anterior/posterior. 
B lymphocyte (also, B cell) A component of 
the adaptive immune system that produces 
antibodies. 

bacterial artificial chromosome (BAG) A 

recombinant plasmid inserted in bacteria that 
serves as a cloning vector capable of accepting 
DNA inserts of 50 to 200 kb. 
bacteriophage A virus that infects bacteria. In 
recombinant DNA technology, bacteriophages 
are used as vectors to carry inserted DNA 
sequences. 

bacteriophage P1 artificial chromosome 
(PAG) A cloning vector that consists of bac¬ 
teriophage PI and is inserted into a plasmid; 
it accepts DNA inserts up to 100 kb. 
bands (1) Visibly darkened areas on autora¬ 
diograms that represent the location of alleles 
on a gel. (2) Alternating dark and light areas 
visible on chromosomes after certain types of 
stains are used. 

Barr body The inactive X chromosome, visible 
as a densely staining chromatin mass in the 
somatic cells of normal females. Also known 
as sex chromatin. 

base One of the four nitrogenous substances 
(adenine, cytosine, guanine, or thymine) 
that make up part of the DNA molecule. Com¬ 
binations of bases specify amino acid 
sequences. 

base analog A substance that can mimic the 
chemical behavior of one of the four DNA 
bases. Base analogs are a type of mutagen, 
base pair (bp) A unit of complementary DNA 
bases in a double-stranded DNA molecule 
(A-T or C-G). 

base pair substitution The replacement of 
one base pair by another. A type of mutation. 
Bayes theorem A statistical procedure in 
which prior and conditional probabilities are 
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used to derive an improved estimate of prob¬ 
ability or risk. 

benign Describes a neoplasm (tumor) that 
does not invade surrounding tissue or metas¬ 
tasize to other parts of the body. Compare 
with malignant. 

bivalent A pair of intertwined homologous 
chromosomes seen in prophase I of meiosis. 
Synonymous with tetrad, 
blood group Molecules found on the surfaces 
of erythrocytes, some of which (ABO and Rh) 
determine blood transfusion compatibility, 
body plan The pattern and arrangement of body 
segments during embryonic development, 
bp Abbreviation for base pair, 
breakpoint The location on a chromosome at 
which a translocation has occurred, 
burden test A statistical test of disease risk in 
which the individual effects of all disease- 
associated variants at a locus are summed 
together. 

candidate gene A gene that, on the basis of 
known properties or protein product, is 
thought to be the gene causing a specific 
genetic disease. 

carcinogen A substance that can produce 
cancer (adjective: carcinogenic), 
carcinogenesis The process of cancer 
development. 

carcinomas Tumors that originate in epithe¬ 
lial cells. 

carrier A person who has a copy of a disease- 
causing gene but does not express the disease. 
The term is usually used to denote heterozy¬ 
gotes for a recessive disease gene, 
catalyst A substance that increases the rate of 
a chemical reaction. Enzymes are an example 
of a catalyst. 

C-banding A type of chromosome staining 
that highlights the constitutive heterochroma¬ 
tin that lies at and near centromeres. 
cDNA Complementary DNA, formed by 
reverse transcription of mRNA purified from 
a collection of cells. This type of DNA corre¬ 
sponds only to coding sequence (exons). 
cDNA library A collection of segments of 
complementary DNA (cDNA) cloned into 
vehicles such as phages or plasmids. Compare 
with genomic library. 

cell cycle The alternating sequence of mitosis 
and interphase. 

cell fate The location and function of cells, pro¬ 
grammed during embryonic development, 
cell-free DNA (cfDNA) DNA molecules that 
are not located within cells, 
cellular immune system The T-cell compo¬ 
nent of the adaptive immune system, 
centimorgan (cM) A unit of measure of the 
frequency of recombination between two 
loci, also known as a map unit. One cM 
corresponds to a recombination frequency 
of 1%. 

centriole Structure in cells that helps to pull 
chromosomes apart during meiosis and 
mitosis. 

centromere The region of a chromosome that 
separates the two arms; centromeres are the 
sites of attachment of spindle fibers during cell 
division. 


CG islands Unmethylated CG sequences that 
are found near the 5' ends of many genes, 
chiasma (plural: chiasmata) The location of 
a crossover between two homologous chro¬ 
mosomes during meiosis. 
chorionic villus sampling (CVS) A prenatal 
diagnostic technique in which a small sample 
of chorionic villi is aspirated. Usually per¬ 
formed at 10 to 12 weeks’ gestation, 
chromatin The combination of proteins (e.g., 
histones) and nucleic acids that makes up 
chromosomes. 

chromatin loop A unit of DNA coiling con¬ 
sisting of a group of solenoids. Each loop is 
approximately 100 kb. 

chromosome Threadlike structure (literally 
“colored body”) consisting of chromatin. 
Genes are arranged along chromosomes, 
chromosome abnormalities A major group 
of genetic diseases, consisting of microscopi¬ 
cally observable alterations of chromosome 
number or structure. 

chromosome banding The process of apply¬ 
ing specific stains to chromosomes in order to 
produce characteristic patterns of bands 
(example: G-banding). 

chromosome breakage The fracture of chro¬ 
mosomes; breakage is increased in the pres¬ 
ence of clastogens. 

chromosome disorders Disorders related to 
chromosomes. 

chromosome instability syndrome Dis¬ 
eases characterized by the presence of large 
numbers of chromosome breaks or exchanges, 
such as sister chromatid exchange (example: 
Bloom syndrome). 

chromosome-specific library A collection of 
DNA fragments from a single chromosome, 
class See isotype. 

class switching The process in which the heavy 
chains of B lymphocytes change from one class, 
or isotype, to another (e.g., IgM to IgG). 
clastogen A substance that can induce chro¬ 
mosome breakage (example: radiation), 
clinical genetics Direct clinical applications 
of genetics affecting the care of patients, 
clone (1) A series of identical DNA fragments 
created by recombinant DNA techniques. (2) 
Identical cells that are descended from a single 
common ancestor. 

cloning, functional A method of isolating 
genes in which a gene whose protein product’s 
function is already known is evaluated as a 
candidate gene responsible for a trait or 
disease. 

cloning, positional The isolation and cloning 
of a disease gene after determining its approxi¬ 
mate physical location; the gene product is 
subsequently determined. Formerly termed 
“reverse genetics.” 

clustered regularly interspersed short palin¬ 
dromic repeat (CRISPR) A gene-editing 
system that uses the Cas9 nuclease enzyme 
and a noncoding guide RNA molecule to 
produce a double-strand break at a specific 
genomic location. Once the break is made, 
DNA base pairs can be deleted or added to 
design animal models of disease or to carry 
out gene therapy. 


coactivator A type of specific transcription 
factor that binds to activators and to the 
general transcription factor complex to regu¬ 
late the transcription of specific genes, 
codominant Alleles that are both expressed 
when they occur together in the heterozygous 
state (example: A and B alleles of the ABO 
blood group system). 

codon A group of three mRNA bases, each 
of which specifies an amino acid when 
translated. 

coefficient of relationship A statistic that 
measures the proportion of genes shared by 
two individuals as a result of descent from a 
common ancestor. 

cofactors Substances that interact with 
enzymes to produce chemical reactions, such 
as various metabolic processes (examples: 
dietary trace elements, vitamins), 
colcemid or colchicine A spindle poison that 
arrests cells in metaphase, making them easily 
discernible microscopically, 
comparative genomic hybridization (CGH) 
Technique in which a mixture of DNA from a 
test source (e.g., a tumor) and a normal 
control are differentially labeled, mixed, and 
hybridized with normal metaphase chromo¬ 
somes or a microarray (array CGH). Differ¬ 
ences in color reveal chromosome losses or 
duplications in the test DNA. 
complement system A component of the 
immune system, encoded by genes in the class 
III MHC region, that can destroy invading 
organisms. The complement system also inter¬ 
acts with other components of the immune 
system, such as antibodies and phagocytes, 
complementary base pairing A fundamen¬ 
tal process in which adenosine pairs only 
with thymine and guanine pairs only with 
cytosine. Also sometimes known as Watson- 
Crick pairing. 

compound heterozygote An individual who 
is heterozygous for two different disease-caus¬ 
ing mutations at a locus. Compare with 
homozygote. Compound heterozygotes for 
recessive disease mutations are usually affected 
with the disorder. 

concordant Refers to two individuals who 
have the same trait (e.g., monozygotic twins 
may be concordant for a disease such as dia¬ 
betes). Compare with discordant (noun: 
concordance). 

conditional knockout The experimental 
inactivation of a gene (e.g., in a mouse 
model) that is confined to specific tissues, 
cells, or a time period during the organism’s 
development. 

conditional probability The probability that 
an event will occur, given that another event 
has already occurred. Conditional probabili¬ 
ties are used, for example, in Bayes theorem, 
confined placental mosaicism A form of 
mosaicism that is observed in the placenta but 
not in the fetus, 
congenital Present at birth, 
consanguinity The mating of related indi¬ 
viduals (adjective: consanguineous), 
consensus sequence A sequence that denotes 
the DNA bases most often seen in a region of 
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interest. The sequences that are found near 
donor and acceptor sites are a type of consen¬ 
sus sequence. 

conservation The preservation of highly 
similar DNA sequences among different 
organisms; conserved sequences are usually 
found in functional genes. 

conserved See conservation, 
constant region A component of the immu¬ 
noglobulin molecule that determines the 
major class of the molecule, 
constitutional or constitutive Pertaining to 
DNA in normal cells of the body, usually used 
in contrast to tumor DNA. 
constitutive heterochromatin See hetero¬ 
chromatin, constitutive, 
contiguous gene syndrome A disease caused 
by the deletion or duplication of multiple 
consecutive genes. See also microdeletion 
syndrome. 

copy number variant (CNV) DNA sequences 
of 1000 or more base pairs that are present in 
variable numbers in different individuals, 
cordocentesis See percutaneous umbilical 
blood sampling (PUBS), 
cosmid A phage-plasmid hybrid capable of 
accepting larger DNA inserts (up to 40-50 kb) 
than either phage or plasmids, 
costimulatory molecule Cell-surface mole¬ 
cule that participates in the binding of T-cell 
receptors to MHC-antigen complexes, 
cross Mating between organisms in genetic 
studies. 

crossing over or crossover The exchange of 
genetic material between homologous chro¬ 
mosomes during meiosis (also occurs rarely 
during mitosis); produces recombination, 
cross-reaction The binding of an antibody to 
an antigen other than the one that originally 
stimulated antibody formation. Such antigens 
are usually very similar to the original anti- 
body-generating antigen, 
cryptic splice site Site at which intron-exon 
splicing can occur when the usual splice site is 
altered. 

cyclin-dependent kinases (CDKs) Enzymes 
that form complexes with specific cyclins to 
phosphorylate regulatory proteins (such as 
pRb) at specific stages of the cell cycle, 
cyclin-dependent kinase (CDK) inhibitors 
Proteins that inactivate cyclin-dependent 
kinases. Many of these are tumor suppressors, 
cyclins Proteins that interact with specific 
cyclin-dependent kinases to regulate the cell 
cycle at specific stages. 

cytogenetics The study of chromosomes and 
their abnormalities. Combines cytology, the 
study of cells, and genetics, 
cytokine A growth factor that causes cells to 
proliferate (example: interleukins), 
cytokinesis Cytoplasmic division that occurs 
during mitosis and meiosis. 
cytosine One of the four DNA bases (abbre¬ 
viation: C). 

cytotoxic T lymphocyte A type of T lympho¬ 
cyte that destroys a cell when the cell presents 
a complex of MHC class I molecule and 
foreign peptide. Part of the cellular immune 
system. Also called killer T lymphocytes. 


daughter cells Cells that result from the divi¬ 
sion of a parent cell. 

deformation Alteration of the form, shape, or 
position of a normally formed body part by 
mechanical forces (example: oligohydramnios 
sequence). 

degenerate Refers to the genetic code—that 
multiple codons can encode the same amino 
acid. 

deletion The loss of chromosome material. 
May be terminal or interstitial. Compare with 

duplication. 

deletion, interstitial Deletion that removes 
part of the interior of the chromosome, 
deletion, terminal Deletion that removes part 
of a chromosome, including one telomere, 
dendritic cells A type of antigen-presenting 
cell that processes material from pathogens, 
presenting the material on the cell surface to 
activate B and T cells, 
de novo mutation See new mutation, 
deoxyribonucleic acid See DNA. 
derivative chromosome A chromosome that 
has been altered as a result of a translocation 
[example: derivative 9, or der(9)]. 
dideoxy method A technique for sequencing 
DNA in which dideoxynucleotides, which ter¬ 
minate replication, are incorporated into rep¬ 
licating DNA strands. 

dimer A molecule that consists of two struc¬ 
turally similar subunits. 

dinucleotide Two sequential nucleotides in a 
DNA sequence. 

diploid Having two copies of each chromo¬ 
some. In humans, the diploid number is 46. 
Compare with haploid, polyploid, 
direct diagnosis A form of DNA-based disease 
diagnosis in which the mutation itself is exam¬ 
ined directly. Compare with indirect diagnosis, 
discordant Refers to two individuals who do 
not share the same trait. Compare with 
concordant. 

disease-targeted sequencing DNA sequenc¬ 
ing protocol in which sequencing is limited to 
the genes already known to cause a specific 
disease. 

dispermy Fertilization of a single ovum by 
two sperm cells. 

dispersed repetitive DNA A class of repeated 
DNA sequences in which single repeats are 
scattered throughout the genome. Compare 
with tandem repeat. 

disruption Morphological defect that results 
from the breakdown of an otherwise normal 
developmental process (example: limb 
reduction defect resulting from poor 
vascularization). 

dizygotic A type of twinning in which each 
twin is produced by the fertilization of a dif¬ 
ferent ovum. Synonymous with fraternal twin. 
Compare with monozygotic. 

DNA (deoxyribonucleic acid) A double-helix 
molecule that consists of a sugar-phosphate 
backbone and four nitrogenous bases (A, C, G, 
and T). DNA bases encode messenger RNA 
(mRNA), which in turn encodes amino acid 
sequences. 

DNA-binding motifs Portions of transcrip¬ 
tion factors that allow them to interact with 


specific DNA sequences (examples: helix- 
loop-helix motif, zinc finger motif). 

DNA chips See microarrays. 

DNA microarrays See microarrays. 

DNA mismatch repair A type of DNA repair 
in which nucleotide mismatches (i.e., violations 
of the G-C, A-T complementary base-pairing 
rule) are corrected by specialized enzymes. 
DNA polymerase An enzyme involved in 
DNA replication and repair. 

DNA profile A series of DNA polymorphisms 
(usually VNTRs or microsatellites) typed in an 
individual. Because these polymorphisms are 
highly variable, the combined genotypes are 
useful in identifying individuals for forensic 
purposes. 

DNA repair A process in which mistakes in the 
DNA sequence are altered to represent the 
original sequence. 

DNA sequence The order of DNA bases along 
a chromosome. 

dominant An allele that is expressed in the 
same way in single copy (heterozygotes) as in 
double copy (homozygotes). Compare with 
recessive. 

dominant negative A type of mutation in 
which the altered protein product in a hetero¬ 
zygote forms a complex with the normal 
protein product produced by the homologous 
normal gene, thus disabling it. 
donor site The GT sequence that defines the 
splice site at the 5' end of an intron. 
dosage compensation The situation in 
which, as a consequence of X inactivation, the 
amount of X chromosome-encoded gene 
product in females is roughly equal to that in 
males. 

dosage mapping A technique for mapping 
genes in which excess or deficient gene product 
is correlated with a chromosomal duplication 
or deletion. 

dosage sensitivity A condition in which 
alteration of the level of a gene product (e.g., 
a deletion resulting in 50% expression or a 
duplication resulting in 150% expression of 
the gene product) causes a substantial altera¬ 
tion of the phenotype (including disease), 
double crossover The occurrence of two 
crossover events relatively close to one another, 
such that loci on either side of the double cross¬ 
over event do not undergo recombination, 
double helix The “twisted-ladder” shape of 
the double-stranded DNA molecule, 
double-stranded DNA breaks A type of 
DNA breakage in which both strands are 
broken at a specific location, 
downstream Refers to the orientation of 
sequences along a gene—specifically the 3' 
direction of the coding strand of DNA or the 
RNA molecule. Compare with upstream, 
driver genes Genes that can confer a growth 
advantage to tumors, making them primary 
causes of cancer. Compare with passenger 
genes. 

duplication The presence of an extra copy 
of chromosome material. Compare with 

deletion. 

dysmorphology The study of abnormal phys¬ 
ical development. 
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dysplasia A defect in which cells are abnor¬ 
mally organized into tissue (example: bone 
dysplasia). 

ectopic expression Expression of a gene 
product in an abnormal location or tissue type, 
electrophoresis A technique in which charged 
molecules are placed in a medium and exposed 
to an electrical field, causing them to migrate 
through the medium at different rates, accord¬ 
ing to charge, length, or other attributes. See 
also protein electrophoresis, 
embryonic stem cell Cells, found in early 
embryos, that have the potential to become 
any cell type (pluripotency). 
empirical risk A risk estimate based on direct 
observation of data. 

endocytosis A process in which molecules are 
transported into the interior of cells, 
enhancer A regulatory DNA sequence that 
interacts with specific transcription factors 
to increase the transcription of genes. 
Compare with silencer. 

equational division The second major cycle 
of meiosis: meiosis II. Compare with reduc¬ 
tion division stage. 

equatorial plane The middle of the cell’s 
spindle, along which homologous chromo¬ 
somes are arranged during metaphase, 
euchromatin Chromatin that is light-staining 
during interphase and tends to be transcription¬ 
ally active. Compare with heterochromatin, 
eugenics The use of controlled breeding to 
increase the prevalence of “desirable” genetic 
traits (positive eugenics) and to decrease the 
prevalence of “undesirable” traits (negative 
eugenics). 

eukaryotes Organisms whose cells have true 
nuclei. 

euploid Refers to cells whose chromosome 
number is a multiple of 23 (in humans) 
(noun: euploidy). 

exome The portion of the genome that is used 
to encode proteins. 

exome sequencing DNA sequencing in 
which only the protein-coding exons (exome) 
are sequenced. 

exons Portions of genes that encode amino 
acids and are retained after the primary mRNA 
transcript is spliced. Compare with intron. 
exon skipping A form of gene therapy in 
which an exon containing a harmful variant, 
such as a frameshift mutation, is removed, 
expanded newborn screening See newborn 
screening. 

expanded repeat A type of mutation in 
which a tandem trinucleotide repeat increases 
in number (example: Huntington disease), 
expressed sequence tag (EST) Several 
hundred bp of known cDNA sequence, 
flanked by PCR primers. Because they are 
derived from cDNA libraries, these sequences 
represent portions of expressed genes, 
false negative A test result in which an 
affected individual is incorrectly identified as 
being unaffected with the disease in question. 
Compare with false positive, 
false positive A test result in which an unaf¬ 
fected individual is incorrectly identified as 


being affected with the disease in question. 
Compare with false negative, 
family See gene family, 
flow cytometry A technique whereby chro¬ 
mosomes can be individually sorted, 
fluorescence in situ hybridization (FISH) A 
molecular cytogenetic technique in which 
labeled probes are hybridized with chromo¬ 
somes and then visualized under a fluores¬ 
cence microscope. 

founder effect A large shift in gene frequen¬ 
cies that results when a small “founder” popu¬ 
lation, which contains limited genetic 
variation, is derived from a larger population. 
Founder effect can be considered a special case 
of genetic drift. 

frameshift mutation An alteration of DNA in 
which a duplication or deletion occurs that is 
not a multiple of three base pairs, 
fusion gene A gene that results from a combi¬ 
nation of two genes or parts of two genes. 

G1 The gap 1 phase of interphase. 

G2 The gap 2 phase of interphase, 
gain of function A class of mutations that 
results in a protein product that either is 
increased in quantity or has a novel function. 
Compare with loss of function, 
gamete The haploid germ cell (sperm and 
ovum). 

gametogenesis The process of gamete 
formation. 

gastrulation The embryonic stage in which 
cells of the blastula are arranged to become a 
three-layer structure consisting of endoderm, 
mesoderm, and ectoderm, 
gene The fundamental unit of heredity, 
gene-environment interaction A mutual 
phenotypic effect of a gene and an environ¬ 
mental factor that is greater than the single 
effect of either factor alone (example: the 
effect of tti-antitrypsin deficiency and ciga¬ 
rette smoking on pulmonary emphysema), 
gene family A group of genes that are similar 
in DNA sequence and have evolved from a 
single common ancestral gene; they may or 
may not be located in the same chromosome 
region. 

gene flow The exchange of genes between dif¬ 
ferent populations. 

gene frequency In a population, the fraction 
of chromosomes that contain a specific gene, 
gene therapy The insertion or alteration of 
genes to correct a disease, 
gene therapy, germline Gene therapy that 
alters all cells of the body, including the germ¬ 
line. Compare with gene therapy, somatic 
cell. 

gene therapy, somatic cell Gene therapy 
that alters somatic cells but not cells of the 
germline. Compare with gene therapy, 
germline. 

genetic ancestry The origins of an individu¬ 
al’s ancestors; most commonly used to refer to 
the geographic origins of one’s ancestors, 
genetic code The combinations of mRNA 
codons that specify individual amino acids, 
genetic counseling The delivery of informa¬ 
tion about genetic diseases (risks, natural 


history, and management) to patients and 
their families. 

genetic drift An evolutionary process in 
which gene frequencies change as a result of 
random fluctuations in the transmission of 
genes from one generation to the next. Drift is 
greater in smaller populations, 
genetic engineering The alteration of genes; 
it typically involves recombinant DNA 
techniques. 

genetic lethal mutation An alteration of 
DNA sequence that prevents an individual 
from transmitting his or her DNA to the next 
generation. 

gene mapping The ordering of genes on 
chromosomes according to recombination 
frequency. Compare with physical mapping, 
genetic screening The large-scale testing of 
defined populations to identify those who are at 
increased risk of having a disease-causing gene, 
genetic testing The analysis of DNA, RNA, or 
proteins to test for the presence of differences 
that might cause a genetic disease, 
genome The totality of an organism’s DNA. 
genome scan A gene-mapping approach in 
which markers from the entire human genome 
are tested for linkage with a disease phenotype, 
genome-wide association studies (GWAS) 
A study design in which allele frequencies at 
large numbers of loci (typically single nucleo¬ 
tide polymorphisms, SNPs) are compared in 
disease cases and unaffected controls. SNPs 
that show large frequency differences between 
cases and controls are likely to be located in or 
near genes that are responsible for the disease, 
genomic instability An abnormal condition 
in which there is a substantial increase in 
mutations throughout the genome. It can 
occur, for example, when a DNA repair system 
is disabled. 

genomic library A collection of DNA frag¬ 
ments from an organism’s entire genome. It 
includes cDNA as well as noncoding DNA. 
Compare with cDNA library, 
genotype An individual’s allelic constitution 
at a locus. 

genotype frequency The fraction of indi¬ 
viduals in a population that carry a specific 
genotype. 

genotype-phenotype correlation The rela¬ 
tionship between different possible genotypes 
(i.e., different alleles) at a locus and the individ¬ 
ual’s phenotype. Because of allelic heterogene¬ 
ity, different alleles at a locus can produce more 
or less severe expression of a disease phenotype 
(e.g., missense vs. nonsense mutations), 
germline Cells responsible for the production 
of gametes. 

Giemsa banding (G-banding) A type of stain¬ 
ing that produces G-bands in chromosomes, 
gliomas Tumors that originate in glial cells, 
globin A major component of the hemoglobin 
molecule. Globin is also found in the verte¬ 
brate myoglobin molecule, 
growth factor A substance capable of stimu¬ 
lating cell proliferation. 

growth factor receptor A structure on cell 
surfaces to which growth factors can bind. 
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guanine One of the four DNA bases (abbre¬ 
viation: G). 

guanosine diphosphate (GDP) A partially 
dephosphorylated form of guanosine 
triphosphate. 

guanosine triphosphate (GTP) A molecule 
required for the synthesis of peptide bonds 
during translation. 

haploid Refers to cells that have one copy of 
each chromosome, the typical state for 
gametes. In humans, the haploid number is 23. 
haploinsufficiency Describes the situation in 
which 50% of the normal level of gene expres¬ 
sion (i.e., in a heterozygote) is not sufficient 
for normal function. 

haplotype The allelic constitution of multiple 
loci on a single chromosome. Derived from 
“haploid genotype.” 

Hardy-Weinberg principle Specifies an equi¬ 
librium relationship between gene frequencies 
and genotype frequencies in populations, 
heavy chain A major structural component of 
an antibody molecule, having a higher molec¬ 
ular weight than the other major component, 
the light chain. There are five major types of 
heavy chains in the human: y, p, a, 5, and E. 
helper T lymphocyte A type of T lymphocyte 
whose receptors bind to a complex of MHC 
class II molecule and foreign peptide on the 
surfaces of antigen-presenting cells. Part of the 
cellular immune system, 
heme The iron-containing component of the 
hemoglobin molecule; binds with oxygen, 
hemizygous Refers to a gene that is present in 
only a single copy [hemi = “half”). Most com¬ 
monly refers to genes on the single male X 
chromosome but can refer to other genes in 
the haploid state, such as the genes homolo¬ 
gous to a deleted region of a chromosome, 
heritability The portion of population vari¬ 
ance in a trait that can be ascribed to genetic 
factors. 

heterochromatin Dark-staining chromatin 
that is usually transcriptionally inactive and 
consists mostly of repetitive DNA. Compare 
with euchromatin. 

heterodisomy The presence in a cell of two 
chromosomes derived from a single parent 
and none from the other parent (disomy). In 
heterodisomy, the two chromosomes are the 
nonidentical homologous chromosomes. 
Compare with isodisomy, 
heterogeneity, allelic Describes conditions 
in which different alleles at a locus can produce 
variable expression of a disease. Depending on 
phenotype definition, allelic heterogeneity can 
cause two distinct diseases, as in Duchenne 
and Becker muscular dystrophy, 
heterogeneity, locus Describes diseases in 
which mutations at distinct loci can produce 
the same disease phenotype (examples: osteo¬ 
genesis imperfecta; retinitis pigmentosa), 
heteroplasmy The existence of differing DNA 
sequences at a locus within a single cell. Often 
seen in mitochondrial genes, 
heterotetramer A molecule consisting of four 
(tetra) subunits, at least one of which differs 
from the others. Compare with homotetramer. 


heterozygote An individual who has two dif¬ 
ferent alleles at a locus. Compare with 

homozygote. 

high-resolution banding Chromosome band¬ 
ing using prophase or prometaphase chromo¬ 
somes, which are more extended than metaphase 
chromosomes and thus yield more bands and 
greater resolution. 

high-throughput DNA sequencing Mas¬ 
sively parallel DNA analysis in which millions 
of short DNA sequences are produced 
simultaneously. 

histone The protein core around which DNA 
is wound in a chromosome. 

HLA-A, HLA-B, HLA-C See human leukocyte 
antigen. 

holandric Refers to Y-linked inheritance; a 
transmission exclusively from father to son. 
homeodomain A DNA-binding portion of 
transcription factor proteins that are involved 
in embryonic development, 
homologous (1) Refers to DNA or amino acid 
sequences that are highly similar to one 
another. (2) Describes chromosomes that pair 
during meiosis, one derived from the 
individual’s father and the other from the 
mother. 

homologs Chromosomes that are homologous, 
homotetramer Molecule consisting of four 
(tetra) identical subunits. Compare with 

heterotetramer. 

homozygote An individual in whom the two 
alleles at a locus are the same. Compare with 

heterozygote. 

housekeeping genes Genes whose protein 
products are required for cellular maintenance 
or metabolism. Because of their central role in 
the life of the cell, housekeeping genes are 
transcriptionally active in all cells, 
human artificial chromosome A synthetic 
chromosome consisting of an artificial centro¬ 
mere and telomeres and an insert of human 
DNA that can be 5 to 10 Mb. 
human leukocyte antigen (HLA) Older term 
for major histocompatibility complex 
(MHC). 

humoral immune system The B-cell compo¬ 
nent of the adaptive immune system, termed 
humoral because antibodies are secreted into 
the circulation. 

immunodeficiency disease A class of dis¬ 
eases characterized by insufficiencies in the 
immune response (example: severe combined 
immune deficiency). 

immunodeficiency disease, primary Disor¬ 
der of the immune system that is caused 
directly by defects (usually genetic) in compo¬ 
nents or cells of the immune system, 
immunodeficiency disease, secondary Dis¬ 
order of the immune system that is a conse¬ 
quence of an agent or defect that originates 
outside the immune system (e.g., infection, 
radiation, drugs). 

immunogenetics The study of the genetic 
basis of the immune system, 
immunoglobulin Receptor found on the sur¬ 
faces of B cells. When secreted into the circula¬ 
tion by B cells that have matured into plasma 


cells, immunoglobulins are known as 

antibodies. 

imprinting, genomic Describes process in 
which genetic material is expressed differently 
when inherited from the mother than when 
inherited from the father, 
in vitro fertilization (IVF) A procedure in 
which the fertilization of an egg cell by a 
sperm cell is carried out in the laboratory. The 
embryo is then implanted in the mother’s 
uterus. 

inbreeding coefficient The probability that 
an individual is homozygous at a locus as a 
result of consanguinity in his or her parents, 
incest The mating of closely related individu¬ 
als, usually describing the union of first-degree 
relatives. 

incidental findings Findings in a patient that 
are not related to the original reason for 
testing (e.g., as in a genetic test) but that have 
potential significance for patient care, 
independence A principle, often invoked in 
statistical analysis, indicating that the occur¬ 
rence of one event has no effect on the prob¬ 
ability of occurrence of another (adjective: 
independent). 

independent assortment One of Mendel’s 
fundamental principles; dictates that alleles at 
different loci are transmitted independently of 
one another. 

index case See proband, 
indirect diagnosis A form of genetic diagno¬ 
sis in which the disease-causing mutation is 
not directly observed; usually refers to diagno¬ 
sis using linked markers. Compare with direct 
diagnosis. 

induction The influence or determination of 
development of one group of cells by a second 
group of cells. 

innate immune system The portion of the 
immune system that does not change its char¬ 
acteristics to respond to infections. Major part 
of the initial immune response. Compare with 
adaptive immune system, 
insert A DNA sequence that is placed into a 
vector, such as a plasmid or cosmid, using 
recombinant DNA techniques, 
insertion The addition of one or more nucleo¬ 
tide bases into a DNA sequence. Compare 
with deletion. 

interphase The portion of the cell cycle that 
alternates with meiosis or mitosis (cell divi¬ 
sion). DNA is replicated and repaired during 
this phase. 

intraclass correlation coefficient A statisti¬ 
cal measure that varies between —1 and 1 and 
specifies the degree of similarity of two quan¬ 
tities in a sample or population, 
intron DNA sequence found between two 
exons. It is transcribed into primary mRNA 
but is spliced out in the formation of the 
mature mRNA transcript, 
inversion A structural rearrangement of a 
chromosome in which two breaks occur, fol¬ 
lowed by reinsertion of the chromosome 
segment, but in reversed order. It may be either 
paracentric or pericentric. See also inversion, 
paracentric and inversion, pericentric. 
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inversion, paracentric An inversion that does 
not include the centromere. 

inversion, pericentric An inversion that 
includes the centromere, 
isochromosome A structural chromosome 
rearrangement caused by the division of a 
chromosome along an axis perpendicular to 
the usual axis of division; results in chromo¬ 
somes with either two short arms or two long 
arms. 

isodisomy The presence in a cell of two identi¬ 
cal chromosomes derived from one parent and 
none from the other parent. Compare with 

heterodisomy. 

isotype Classes of immunoglobulin molecules 
(e.g., IgA, IgE, IgG), determined by the type of 
heavy chain present in the molecule, 
joint probability The probability of two 
events both occurring, 
karyogram See karyotype, 
karyotype A display of chromosomes, ordered 
according to length. 

killer cell immunoglobulin-like receptors 
(KIR) Molecules found on the surfaces of 
natural killer cells (see natural killer cell); 
their function is to inhibit the action of killer 
cells when they bind to MHC class I molecules 
on the surfaces of normal cells, 
killer T lymphocyte See cytotoxic T 
lymphocyte. 

kilobase (kb) One thousand DNA base pairs, 
knockout An animal model in which a specific 
gene has been disabled. 

lentivirus A type of retrovirus that can enter 
nondividing cells. 

leukemias Tumors that originate in hemato¬ 
poietic cells. 

liability distribution A theoretical scale of 
susceptibility to disease, which varies among 
individuals depending on genetic and envi¬ 
ronmental risk factors. 

light chain A major structural component of 
the antibody molecule, consisting of either a K 
or a X chain. The light chain has a lower 
molecular weight than the other major com¬ 
ponent, the heavy chain, 
likelihood or likelihood ratio A statistic that 
measures the probability of an event or a series 
of events. 

LINES (long interspersed elements) A class 
of dispersed repetitive DNA in which each 
repeat is relatively long, up to 7 kb. Compare 
with SINEs. 

linkage or linked Describes two loci that are 
located close enough on the same chromo¬ 
some that their recombination frequency is 
less than 50%. 

linkage disequilibrium A nonrandom asso¬ 
ciation of alleles at linked loci in populations. 
Compare with linkage equilibrium, 
linkage equilibrium Lack of preferential asso¬ 
ciation of alleles at linked loci. Compare with 
linkage disequilibrium. 

linkage phase The arrangement of alleles of 
linked loci on chromosomes, 
liposome A fat body sometimes used as a 
vector for somatic cell gene therapy, 
locus The chromosome location of a specific 
gene (plural: loci). 


locus control region A DNA sequence in the 
5' region of the globin gene clusters that is 
involved in transcriptional regulation. 

LOD score The common logarithm of the 
ratio of the likelihood of linkage at a specific 
recombination fraction to the likelihood of 
no linkage. 

loss of function A class of mutation in which 
the alteration results in a nonfunctional protein 
product. Compare with gain of function, 
loss of heterozygosity Describes a locus or 
loci at which a deletion or other process has 
converted the locus from heterozygosity to 
homozygosity or hemizygosity. 
low-copy repeats Repeated DNA sequences 
in the genome whose number is relatively 
small. Contrast with Alu repeats or short 
tandem repeats. 

lymphomas Tumors that originate in cells of 
the lymphatic system. 

Lyon hypothesis A proposal (now verified) 
that one X chromosome is randomly inacti¬ 
vated in each somatic cell of the normal 
female embryo (lyonization). 
macrophage A type of phagocyte that ingests 
foreign microbes and displays them on its 
surface for recognition by T-cell receptors, 
major gene A single locus responsible for a 
trait (sometimes contrasted with a polygenic 
component). 

major histocompatibility complex (MHC), 
class I A membrane-spanning glycoprotein, 
found on the surfaces of nearly all cells, that 
presents antigen for recognition by cytotoxic 
T lymphocytes. Compare with MHC class II. 
major histocompatibility complex (MHC), 
class II A membrane-spanning glycopro¬ 
tein, found on the surfaces of antigen-pre¬ 
senting cells, that presents antigen for 
recognition by helper T cells, 
malformation A primary morphologic defect 
resulting from an intrinsically abnormal devel¬ 
opmental process (example: polydactyly). 
malignant Describes a tumor that is capable 
of invading surrounding tissue and metasta¬ 
sizing to other sites in the body. Compare with 
benign. 

manifesting heterozygote An individual 
who is heterozygous for a recessive trait but 
displays the trait. Most often used to describe 
females who are heterozygous for an X-linked 
trait and display the trait, 
markers Polymorphisms, such as RFLPs, 
VNTRs, microsatellite repeats, and blood 
groups, that are linked to a disease locus, 
mass spectrometry Analysis of the mass-to- 
charge ratio of molecules; can be used to 
sequence DNA and detect mutations, 
mass spectrometry, tandem A form of mass 
spectrometry in which two mass spectrometry 
machines are used, the first of which separates 
molecules according to mass, and the second 
of which assesses the mass and charge of the 
molecules after they have been fragmented, 
maternal serum alpha-fetoprotein (MSAFP) 
Alpha-fetoprotein present in the serum of 
pregnant women; used in prenatal screening 
for fetal disorders such as neural tube defects 
and Down syndrome. 


mature transcript Describes mRNA after the 
introns have been spliced out. Before splicing, 
the mRNA is referred to as a primary 
transcript. 

maximum likelihood estimate A statistical 
procedure in which the likelihoods of a variety 
of parameter values are estimated and then 
compared to determine which likelihood is the 
largest. Used, for example, in evaluating LOD 
scores to determine which recombination fre¬ 
quency is the most likely. 

Mb (megabase) One million base pairs, 
meiosis Cell division process in which haploid 
gametes are formed from diploid germ cells, 
meiotic failure Aberrant meiosis in which a 
diploid gamete is produced rather than the 
normal haploid gamete. 

memory cells A class of high-affinity-binding 
B or T cells that remain in the body after an 
immune response has ended; they provide a 
relatively rapid, high-affinity response should 
the same disease antigen be encountered a 
second time. 

mendelian Referring to Gregor Mendel; 
describes a trait that is attributable to a single 
gene. 

mesenchyme Tissue that forms connective 
tissues and lymphatic and blood vessels during 
embryonic development, 
messenger RNA (mRNA) An RNA molecule 
that is formed from the transcription of 
DNA. Before intron splicing, mRNA is termed 
a primary transcript; after splicing, the 
mature transcript (or mature mRNA) pro¬ 
ceeds to the cytoplasm, where it is translated 
into an amino acid sequence, 
metacentric A chromosome in which the cen¬ 
tromere is located approximately in the middle 
of the chromosome arm. 
metaphase A stage of mitosis and meiosis in 
which homologous chromosomes are 
arranged along the equatorial plane, or meta¬ 
phase plate, of the cell. This is the mitotic stage 
at which chromosomes are maximally con¬ 
densed and most easily visualized, 
metastasis The spread of malignant cells 
from one site in the body to another (verb: 
metastasize). 

methylation The attachment of methyl 
groups; in genetics, refers especially to the 
addition of methyl groups to cytosine bases, 
forming 5-methylcytosine. Methylation is cor¬ 
related with reduced transcription of genes. 
MHC restriction The limiting of immune 
response functions to MHC-mediated inter¬ 
actions (e.g., the binding of T-cell receptors is 
MHC restricted because it requires the presen¬ 
tation of antigen by MHC class I or MHC class 
II molecules). 

microarrays Arrangement of large numbers 
of DNA sequences, such as oligonucleotides 
consisting of normal and mutated sequences, 
on glass slides or silicone chips (DNA chips). 
These oligonucleotides can be hybridized to 
labeled DNA from subjects to test for sequence 
variants, to sequence DNA, or to analyze gene 
expression patterns. 

microdeletion syndrome A chromosome 
deletion that is too small to be visible under a 
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microscope (examples: DiGeorge syndrome; 
Prader-Willi syndrome). See also contiguous 
gene syndrome. 

microRNAs (miRNA) Small untranslated RNA 
sequences that bind to and regulate mRNA 
sequences. 

microsatellite A type of satellite DNA that 
consists of small repeat units (usually 2, 3, 4, 
or 5 bp) that occur in tandem, 
microsatellite repeat polymorphism See 
short tandem repeats (STRs). 
minisatellite A type of satellite DNA that 
consists of tandem repeat units that are each 
about 20 to 70 bp long. Variation in the 
number of minisatellite repeats is the basis of 
VNTR polymorphisms. 

mismatch The presence in one chain of dou¬ 
ble-stranded DNA of a base that is not com¬ 
plementary to the corresponding base in the 
other chain. Also known as mispairing. 
mismatch repair A DNA repair process in 
which mismatched nucleotides are altered so 
that they become complementary, 
missense mutation A type of mutation that 
results in a single amino acid change in the 
translated gene product. Compare with non¬ 
sense mutation. 

mitochondria Cytoplasmic organelles that are 
important in cellular respiration. The mito¬ 
chondria have their own unique DNA. 

mitochondrial disorders Disorders of 
mitochondria. 

mitosis The ceU division process in which two 
identical progeny cells are produced from a 
single parent cell. Compare with meiosis. 
mobile elements DNA sequences that are 
capable of inserting themselves (or copies of 
themselves) into other locations in the 
genome. 

modifier gene A gene that alters the expres¬ 
sion of a gene at another locus, 
molecular genetics Study of the structure 
and function of genes at the molecular 
level. 

monoclonal Refers to a group of cells that 
consist of a single clone (i.e., all cells are 
derived from the same single ancestral cell), 
monogenic Describing a single-gene, or men- 
delian, trait. 

monosomy An aneuploid condition in which 
a specific chromosome is present in only a 
single copy, giving the individual a total of 45 
chromosomes. 

monozygotic Describes a twin pair in which 
both members are derived from a single 
zygote. Synonymous with identical twin. 
Compare with dizygotic, 
morphogenesis The process of development 
of a cell, organ, or organism, 
mosaicism The existence of two or more genet¬ 
ically different ceU lines in an individual, 
mosaic, germline A type of mosaic in which 
the germline of an individual contains an 
allele not present in the somatic cells, 
multifactorial Describes traits or diseases that 
are the product of the interactions of multiple 
genetic and environmental factors (example: 
neural tube defects). 

multifactorial disorders See multifactorial. 


multi-hit concept of carcinogenesis The 

principle that most tumors arise from a series 
of errors, or “hits,” occurring in a cell, 
multiplication rule A law of probability that 
states that the probability of co-occurrence of 
two or more independent events can be 
obtained by multiplying the individual prob¬ 
abilities of each event. 

multipoint mapping A type of genetic 
mapping in which the recombination frequen¬ 
cies among three or more loci are estimated 
simultaneously. 

mutagen A substance that causes a mutation, 
mutation An alteration in DNA sequence, 
mutation hot spots Nucleotide sequences in 
which the mutation rate is substantially 
elevated. 

mutation. Induced A mutation that is caused 
by an exogenous factor, such as radiation. 
Compare with mutation, spontaneous, 
mutation, spontaneous A mutation that is 
not known to be the result of an exogenous 
factor. Compare with mutation, induced, 
mutation-selection balance A state in which 
the rate of elimination of an allele from a 
population (due to natural selection) is equal 
to the rate of introduction of the allele in a 
population (due to mutation). Mutation- 
selection balance can predict the gene fre¬ 
quency of an allele in a population, 
natural killer cell A type of lymphocyte that 
is involved in the early phase of defense against 
foreign microbes and tumors and is not MHC 
restricted. 

natural selection An evolutionary process in 
which individuals with favorable genotypes 
produce relatively greater numbers of surviv¬ 
ing offspring. 

negative predictive value In disease 
screening, the percentage of subjects with a 
negative test result who truly do not have the 
disease. Compare with positive predictive 
value. 

neoplasm or tumor A group of cells charac¬ 
terized by unregulated proliferation (may be 
benign or malignant). 

neurulation Formation of the neural tube 
during embryonic development, 
new mutation An alteration in DNA sequence 
that appears for the first time in a family as the 
result of a mutation in one of the parents’ 
germ cells. 

newborn screening Testing the population of 
newborn infants for a condition, such as 
PKU, that is detectable shortly after birth. 
Expanded newborn screening refers to the 
use of techniques such as tandem mass spec¬ 
trometry that allow a larger number of condi¬ 
tions to be screened in the newborn 
population. 

nondirectiveness Describes the genetic coun¬ 
seling approach in which information is pro¬ 
vided to a family, while decisions about 
reproduction are left: to the family, 
nondisjunction Failure of homologous chro¬ 
mosomes (in mitosis or meiosis I) or sister 
chromatids (in meiosis II) to separate properly 
into different progeny cells. Can produce 
aneuploidy. 


noninvasive prenatal screening (NIPS) Forms 
of prenatal screening or testing in which target 
tissue is not penetrated or disturbed, 
nonmendelian Any of the nonautosomal 
modes of inheritance that were not studied by 
Gregor Mendel. 

nonsense mutation A type of mutation in 
which an mRNA stop codon is produced, 
resulting in premature termination of transla¬ 
tion, or removed, resulting in an elongated 
protein product. Compare with missense 
mutation. 

Northern blotting A gene expression assay in 
which mRNA on a blot is hybridized with a 
labeled probe. 

nucleosome A structural unit of chromatin 
in which 140 to 150 bp of DNA are wrapped 
around a core unit of eight histone 
molecules. 

nucleotide A basic unit of DNA or RNA, con¬ 
sisting of one deoxyribose (or ribose in the 
case of RNA), one phosphate group, and one 
nitrogenous base. 

nucleotide excision repair A type of DNA 
repair in which altered groups of nucleotides, 
such as pyrimidine dimers, are removed 
and replaced with properly functioning 
nucleotides. 

obligate carrier An individual who is known 
to possess a disease-causing gene (usually on 
the basis of pedigree examination) but may or 
may not be affected with the disease 
phenotype. 

oligonucleotide DNA sequence consisting of 
a small number of nucleotide bases, 
oncogene A gene that can transform cells 
into a highly proliferative state, causing 
cancer. 

oogenesis The process by which ova are 
produced. 

oogonia The diploid germline stem cells from 
which ova are ultimately derived, 
organogenesis The formation of organs 
during embryonic development, 
ovum Mature female reproductive cell, 
palindrome A DNA sequence whose comple¬ 
mentary sequence is the same if read back¬ 
ward (e.g., 5'-AATGCGCATT-3'). 
panmixia Describes a population in which 
individuals mate at random with respect to a 
specific genotype. 

paralog Within a species, a member of a set of 
homologous genes (example: HOXA13 and 
HOXD13). 

partial trisomy A chromosome abnormality 
in which a portion of a chromosome is present 
in three copies; may be produced by reciprocal 
translocation or unequal crossover. See also 
translocation, reciprocal and crossover, 
passenger genes Genes that exhibit somatic 
mutations during tumorigenesis but do not 
give rise to a growth advantage. Compare with 
driver genes. 

pattern formation The spatial arrangement 
of differentiated cells to form tissues and 
organs during embryonic development, 
pedigree A diagram that describes family rela¬ 
tionships, sex, disease status, and other 
attributes. 
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penetrance The probability of expressing a 
phenotype, given that an individual has inher¬ 
ited a predisposing genotype. If this probabil¬ 
ity is less than 1.0, the disease genotype is said 
to have reduced or incomplete penetrance, 
penetrance, age-dependent Describes dis¬ 
ease phenotypes that have a higher probabil¬ 
ity of occurrence as the age of the individual 
with the at-risk genotype increases (examples: 
Huntington disease, familial breast cancer), 
percutaneous umbilical blood sampling 
(PUBS) Prenatal diagnostic technique in 
which fetal blood is obtained by puncture of 
the umbilical cord. Also called cordocentesis. 
personalized medicine An approach to 
medical care in which treatment is designed 
specifically for the individual patient. In 
genetics, the goal is to incorporate the genetic 
profile of the patient into decisions about 
diagnosis and treatment. See also precision or 
predictive medicine. 

phagocyte A cell that engulfs foreign 
particles. 

pharmacogenetics Study of genetic variation 
in drug response. 

pharmacogenomics Study of genetic varia¬ 
tion in drug response, using data from many 
genes assayed across the genome (compare 
with pharmacogenetics), 
phenocopy A phenotype that resembles the 
phenotype produced by a specific gene but is 
caused instead by a different, typically non- 
genetic, factor. 

phenotype The observed characteristics of an 
individual, produced by the interaction of 
genes and environment. 

Philadelphia chromosome A reciprocal 
translocation between the long arms of chro¬ 
mosomes 9 and 22 in somatic cells; produces 
chronic myelogenous leukemia, 
physical mapping The determination of 
physical distances between genes using cyto¬ 
genetic and molecular techniques. Compare 
with gene mapping, in which recombination 
frequencies are estimated, 
plasma cell A mature B lymphocyte capable 
of secreting antibodies. 

plasmid A circular double-stranded DNA mol¬ 
ecule found in bacteria; capable of independent 
replication. Plasmids are often used as cloning 
vectors in recombinant DNA techniques, 
pleiotropy Describes genes that have multiple 
phenotypic effects (examples: Marfan syn¬ 
drome, cystic fibrosis) (adjective: pleiotropic). 
pluripotency The ability of a cell to develop 
into more than one type of mature, differenti¬ 
ated cell type. 

point mutation (1) In molecular genetics, the 
alteration of a single nucleotide to a different 
nucleotide. (2) In classical genetics, an altera¬ 
tion of DNA sequence too small to be detected 
under a light microscope, 
polar body A ceU produced during oogenesis 
that has a nucleus but very little cytoplasm, 
polar body diagnosis A prenatal diagnostic 
technique in which DNA from a polar body is 
subjected to PCR amplification and assayed 
using molecular methods. 


polarity Direction (e.g., definition of anterior 
versus posterior in axis specification). 
poly-A tail The addition of several adenine 
nucleotides to the 3' end of a primary mRNA 
transcript. 

polygenic Describes a trait caused by the com¬ 
bined additive effects of multiple genes, 
polymerase chain reaction (PCR) A tech¬ 
nique for amplifying a large number of copies 
of a specific DNA sequence flanked by two 
oligonucleotide primers. The DNA is alter¬ 
nately heated and cooled in the presence of 
DNA polymerase and free nucleotides so that 
the specified DNA segment is denatured, 
hybridized with primers, and extended by 
DNA polymerase. 

polymorphism A locus in which two or more 
alleles have gene frequencies greater than 0.01 
in a population. When this criterion is not 
fulfilled, the locus is monomorphic. 
polypeptide A series of amino acids linked 
together by peptide bonds, 
polyploidy A chromosome abnormality in 
which the number of chromosomes in a cell is 
a multiple of 23 but is greater than the diploid 
number (examples are triploidy [69 chromo¬ 
somes in the human] and tetraploidy [92 
chromosomes in the human]), 
population genetics The branch of genetics 
dealing with genetic variation and genetic evo¬ 
lution of populations. 

population screening The large-scale testing 
of populations for a disease, 
positional candidate A gene-mapping 
approach in which linkage analysis is used to 
define a gene’s approximate location. Known 
candidate genes in the region are then evalu¬ 
ated for their possible role in causing the trait 
or disease being analyzed, 
positional cloning See cloning, positional, 
positive predictive value Among the 
individuals identified by a test as having a 
disease, the percentage that actually have the 
disease. Compare with negative predictive 
value. 

posterior probability In Bayesian analysis, 
the final probability of an event after taking 
into account prior, conditional, and joint 
probabilities. 

posttranslational modification Various types 
of additions and alterations of a polypeptide 
that take place after the mature mRNA tran¬ 
script is translated into a polypeptide (e.g., 
hydroxylation, glycosylation, cleavage of por¬ 
tions of the polypeptide), 
precision or predictive medicine Medical 
practice that is tailored to the specific needs of 
an individual patient. Synonymous with 
personalized medicine. 

preimplantation genetic diagnosis (PGD) 

A form of genetic testing in which one or 
two cells obtained from an early embryo 
(created by IVF) are checked for chromosome 
abnormalities (typically by FISH) or single¬ 
gene mutations (by PCR amplification of 
DNA). 

prenatal diagnosis The identification of a 
disease in a fetus or embryo. 


presymptomatic diagnosis The identifica¬ 
tion of a disease before the phenotype is clini¬ 
cally observable. 

primary oocyte The diploid product of an 
oogonium. All primary oocytes are produced 
in the female during prenatal development; 
they undergo meiosis I to produce secondary 
oocytes when ovulation begins, 
primary spermatocyte The diploid progeny 
cell of a spermatogonium, which then under¬ 
goes meiosis I to produce secondary 
spermatocytes. 

primary transcript The mRNA molecule 
directly after transcription from DNA. A 
mature mRNA transcript is formed from the 
primary transcript when the introns are 
spliced out. 

primer An oligonucleotide sequence that 
flanks either side of the DNA to be amplified 
by polymerase chain reaction, 
primer extension Part of the polymerase 
chain reaction process, in which DNA poly¬ 
merase extends the DNA sequence beginning 
at an oligonucleotide primer, 
primitive streak A structure formed during 
mammalian gastrulation, consisting of thick¬ 
ened epiblast tissue along the anterior/poste¬ 
rior axis. 

prior probability In Bayesian analysis, the 
probability that an event will occur, estimated 
before any additional information, such as a 
biochemical carrier test, is incorporated, 
probability The proportion of times that a 
specific event occurs in a series of trials, 
proband The first person in a pedigree to be 
identified clinically as having the disease in 
question. Synonymous with propositus and 
index case. 

probe In molecular genetics, a labeled sub¬ 
stance, such as a DNA segment, that is used to 
identify a gene, mRNA transcript, or gene 
product (usually through hybridization of the 
probe with the target). 

prokaryote Any organism that lacks a defined 
nucleus. 

promoter A DNA sequence located 5' of a 
gene to which RNA polymerase binds in 
order to begin transcription of the DNA into 
mRNA. 

promoter mutation Any alteration of DNA 
sequence that is found in the promoter region 
of a gene. 

proofreading The correction of errors that 
occur during replication, transcription, or 
translation. 

prophase The first stage of mitosis and meiosis. 
propositus See proband, 
protein electrophoresis A technique in 
which amino acid variations are identified on 
the basis of charge differences that cause dif¬ 
ferential mobility of polypeptides through an 
electrically charged medium, 
protein kinase An enzyme that phosphory- 
lates serine, threonine, or tyrosine residues in 
proteins. 

protein truncation test A mutation detection 
test in which the encoded protein product is 
artificially translated to reveal the presence of 
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truncation-causing mutations (e.g., nonsense 
or frameshift mutations), 
proto-oncogene A gene whose protein 
product is involved in the regulation of cell 
growth. When altered, a proto-oncogene can 
become a cancer-causing oncogene, 
pseudoautosomal region The distal tip of the 
Y chromosome short arm, which undergoes 
crossover with the distal tip of the X chromo¬ 
some short arm during meiosis in the male, 
pseudogene A gene that is highly similar in 
sequence to another gene or genes but has 
been rendered transcriptionally or transla- 
tionally inactive by mutations, 
pseudomosaicism A false indication of fetal 
mosaicism, caused by an artifact of cell culture. 
Punnett square A table specifying the geno¬ 
types that can arise from the gametes contrib¬ 
uted by a mating pair of individuals, 
purine The two DNA (also RNA) bases, adenine 
and guanine, that consist of double carbon- 
nitrogen rings. Compare with pyrimidine, 
pyrimidine The bases (cytosine and thymine 
in DNA; cytosine and uracil in RNA) that 
consist of single carbon-nitrogen rings. 
Compare with purine. 

quadruple screen A screening test for fetal 
Down syndrome, and several other condi¬ 
tions, that can be carried out on maternal 
serum during pregnancy. The quadruple 
screen tests maternal serum levels of 
unconjugated estriol, human chorionic 
gonadotropin, inhibin-A, and maternal serum 
alpha-fetoprotein. 

quantitative trait A characteristic that can be 
measured on a continuous scale (e.g., height, 
weight). 

quantitative trait loci Genes responsible for 
variation in quantitative traits, such as height, 
quasidominant A pattern of inheritance that 
appears to be autosomal dominant but is actu¬ 
ally autosomal recessive. Usually the result of 
a mating between an affected homozygote and 
a heterozygote. 

quinacrine banding (Q-banding) Chromo¬ 
some staining technique in which a fluoro- 
chrome dye (quinacrine compound) is added 
to chromosomes, which are then viewed under 
a fluorescence microscope, 
race A grouping of human populations that 
may be based on geographic origin, language, 
or other attributes. Most commonly, human 
races have corresponded roughly to continen¬ 
tal origins. 

radiation, ionizing A type of energy emission 
that is capable of removing electrons from 
atoms, thus causing ions to form (example: 
x-rays). 

radiation, nonionizing A type of energy 
emission that does not remove electrons from 
atoms but can change their orbits (example: 
ultraviolet radiation), 
random mating See panmixia, 
receptor A cell-surface structure that binds to 
extracellular particles. 

recessive An allele that is phenotypically 
expressed only in the homozygous or hemizy- 
gous state. The recessive allele is masked by a 


dominant allele when the two occur together 
in a heterozygote. Compare with dominant, 
recombinant DNA A DNA molecule that con¬ 
sists of components from more than one 
parent molecule (e.g., a human DNA insert 
placed in a plasmid vector), 
recombinase An enzyme that helps to bring 
about somatic recombination (especially 
important in B and T lymphocytes), 
recombination The occurrence among off¬ 
spring of new combinations of alleles, result¬ 
ing from crossovers that occur during parental 
meiosis. 

recombination frequency The percentage of 
meioses in which recombinants between two 
loci are observed. Used to estimate genetic dis¬ 
tances between loci. See also centimorgan. 
recombination hot spot A region of a chro¬ 
mosome in which the recombination fre¬ 
quency is elevated. 

recurrence risk The probability that another 
affected offspring will be produced in families 
in which one or more affected offspring have 
already been produced. 

reduction division stage The first stage of 
meiosis (meiosis I), in which the chromosome 
number is reduced from diploid to haploid, 
redundancy, genetic The existence of alter¬ 
nate genetic mechanisms or pathways that can 
compensate when a mechanism or pathway is 
disabled. 

regulatory T cell Types of T cells that func¬ 
tion to control other components of the 
immune response. 

repetitive DNA DNA sequences that are 
found in multiple copies in the genome. They 
may be dispersed or repeated in tandem, 
replication The process in which the double- 
stranded DNA molecule is duplicated, 
replication bubble A replication structure 
that occurs in multiple locations on a chromo¬ 
some, allowing replication to proceed more 
rapidly. 

replication origin The point at which replica¬ 
tion begins on a DNA strand; in eukaryotes, 
each chromosome has numerous replication 
origins. 

restriction digest Process in which DNA is 
exposed to a restriction enzyme, causing it to 
be cleaved into restriction fragments, 
restriction endonuclease A bacterial enzyme 
that cleaves DNA at a specific DNA sequence 
(restriction site). 

restriction enzymes Enzymes that cleave 
DNA at specific nucleotide sequences, 
restriction fragment A piece of DNA that has 
been cleaved by a restriction endonuclease, 
restriction fragment length polymorphism 
(RFLP) Variations in DNA sequence in popu¬ 
lations, detected by digesting DNA with a restric¬ 
tion endonuclease, electrophoresing the resulting 
restriction fragments, transferring the fragments 
to a solid medium (blot), and hybridizing the 
DNA on the blot with a labeled probe, 
restriction site A DNA sequence that is 
cleaved by a specific restriction endonuclease, 
retrovirus A type of RNA virus that can 
reverse-transcribe its RNA into DNA for 


insertion into the genome of a host cell; useful 
as a vector for gene therapy, 
reverse banding (R-banding) A chromosome 
banding technique in which chromosomes are 
heated in a phosphate buffer; produces dark 
and light bands in patterns that are the reverse 
of those produced by G-banding. 
reverse transcriptase An enzyme that tran¬ 
scribes RNA into DNA (hence “reverse”), 
ribonucleic acid (RNA) A single-stranded 
molecule that consists of a sugar (ribose), a 
phosphate group, and a series of bases 
(adenine, cytosine, guanine, and uracil). There 
are three basic types of RNA: messenger RNA 
(mRNA), ribosomal RNA (rRNA), and trans¬ 
fer RNA (tRNA). 

ribosomal RNA (rRNA) In conjunction with 
protein molecules, composes the ribosome, 
ribosome The site of translation of mature 
messenger RNA into amino acid sequences, 
ribozyme An mRNA molecule that has 
catalytic activity. Some ribozymes can be 
used to cleave mRNA in somatic cell gene 
therapy. 

ring chromosome A structural chromosome 
that is abnormally formed when both ends of 
a chromosome are lost and the new ends fuse 
together. 

RNA interference (RNAi) A method in which 
a specific mRNA sequence is recognized and 
destroyed by a complex of proteins that exist 
naturally in eukaryotic cells. RNAi can be used 
to block the expression of specific genes or, in 
the context of gene therapy, gain-of-function 
mutations. 

RNA polymerase An enzyme that binds to a 
promoter site and synthesizes messenger RNA 
from a DNA template. 

RNA sequencing (RNA-seq) Determination 
of the sequence of an RNA molecule; usually 
carried out with high-throughput sequencing 
technology. See also high-throughput DNA 
sequencing. 

S The DNA replication phase of interphase, 
sarcomas Tumors that originate in cells of 
connective tissue. 

satellite DNA A portion of the DNA that 
differs enough in base composition so that it 
forms a distinct band on a cesium chloride 
gradient centrifugation; usually contains 
highly repetitive DNA sequences, 
secondary oocyte A cell containing 23 dou¬ 
ble-stranded chromosomes, produced from a 
primary oocyte after meiosis I in the female, 
secondary spermatocyte A cell containing 
23 double-stranded chromosomes, produced 
from a primary spermatocyte after meiosis I 
in the male. 

segregation The distribution of genes from 
homologous chromosomes to different 
gametes during meiosis. 
selection coefficient A numerical measure of 
the degree of natural selection against a spe¬ 
cific genotype, typically measured as the 
number of offspring produced by individuals 
who have the genotype, relative to other geno¬ 
types at the locus. A coefficient of zero indi¬ 
cates that there is no selection against a 
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genotype, and a coefficient of one indicates 
lethality of the genotype, 
senescent Aged (e.g., an aged, or senescent, cell), 
sense strand In a double-stranded DNA mol¬ 
ecule, this is the strand from which messenger 
RNA is not transcribed. Because of comple¬ 
mentary base pairing, the sense strand is iden¬ 
tical in sequence to the transcribed mRNA 
(with the exception that mRNA has uracil 
instead of thymine). See antisense strand, 
sensitivity The percentage of affected indi¬ 
viduals who are correctly identified by a test 
(true positives). Compare with specificity, 
sequence (formerly "anomalad") Aprimary 
defect with secondary structural changes in 
development (examples: oligohydramnios 
sequence, Pierre-Robin sequence), 
sequence tagged sites (STSs) DNA 
sequences of several hundred base pairs that 
are fianked by PCR primers. Their chromo¬ 
some location has been established, making 
them useful as indicators of physical positions 
on the genome. 

serious adverse drug reaction A clinically 
significant reaction to therapeutically admin¬ 
istered drug. 

sex chromosomes The X and Y chromo¬ 
somes in humans. Compare with autosomes. 
sex-influenced A trait whose expression is 
modified by the sex of the individual possess¬ 
ing the trait. 

sex-limited A trait that is expressed in only 
one sex. 

sex-linked A trait that is caused by genes on 
the sex chromosomes (X or Y). 
short tandem repeat (STR) A DNA sequence 
that contains multiple repeated short 
sequences, one after another. These sequences 
can be polymorphic because the number of 
repeats varies among individuals, 
signal transduction A process in which bio¬ 
chemical messages are transmitted from the 
cell surface to the nucleus, 
silencer A DNA sequence that binds to spe¬ 
cific transcription factors to decrease or 
repress the activity of certain genes. Compare 
with enhancer. 

silent substitution A DNA sequence change 
that does not change the amino acid sequence 
because of the degeneracy of the genetic code. 
SINEs (short interspersed elements) A class 
of dispersed repetitive DNA in which each 
repeat is relatively short. Compare with LINEs. 
single-copy DNA DNA sequences that occur 
only once in the genome. Compare with 
repetitive DNA. 

single-gene disorder or trait A feature or 
disease that is caused by a single gene. 
Compare with polygenic and multifactorial, 
single nucleotide polymorphisms (SNPs) 

Polymorphisms that result from variation at a 
single nucleotide. Compare with microsatel¬ 
lites and VNTRs. 

sister chromatid exchange Crossover between 
sister chromatids; can occur either in the sister 
chromatids of a tetrad during meiosis or 
between sister chromatids of a duplicated 
somatic chromosome. 


sister chromatids The two identical strands 
of a duplicated chromosome, joined by a 
single centromere. 

solenoid A structure of coiled DNA, consist¬ 
ing of approximately six nucleosomes. 
somatic cell A cell other than those of the 
gamete-forming germline. In humans, most 
somatic cells are diploid, 
somatic cell nuclear transfer The placement 
of a nucleus derived from one somatic cell into 
a different cell (the latter is typically an ovum 
from which the original nucleus has been 
removed). 

somatic hypermutation An extreme increase 
in the mutation rate of somatic cells; observed 
in B lymphocytes as they achieve increased 
binding affinity for a foreign antigen, 
somatic recombination The exchange of 
genetic material between homologous chro¬ 
mosomes during mitosis in somatic cells; 
much rarer than meiotic recombination. 
Southern transfer (also Southern blot) Lab¬ 
oratory procedure in which DNA fragments 
that have been electrophoresed through a gel 
are transferred to a solid membrane, such as 
nitrocellulose. The DNA can then be hybrid¬ 
ized with a labeled probe and exposed to x-ray 
film (an autoradiogram), 
spatial colinearity Correlation between the 
5'-3' arrangement of genes along chromo¬ 
somes and their spatial expression (e.g., ante¬ 
rior to posterior) during development, 
specificity The percentage of unaffected indi¬ 
viduals who are correctly identified by a test 
(true negatives). Compare with sensitivity, 
spectral karyotype A chromosome display 
(karyotype) in which combinations of fluo¬ 
rescent probes are used with special cameras 
and image-processing software so that each 
chromosome has a unique color. 
Spemann-Mangold organizer A group of 
cells in the developing amphibian embryo that 
initiates development of the central nervous 
system. 

sperm Mature male reproductive cells, 
spermatid One of the four haploid cells 
formed from a primary spermatocyte during 
spermatogenesis. Spermatids mature into 
spermatozoa. 

spermatogenesis The process of male gamete 
formation. 

spermatogonia Diploid germline stem cells 
from which sperm cells (spermatozoa) are 
ultimately derived. 

spindle fiber One of the microtubular threads 
that form the spindle in a cell, 
splice site mutation DNA sequence altera¬ 
tions in donor or acceptor sites or in the con¬ 
sensus sites near them. Produces altered intron 
splicing, such that portions of exons are 
deleted or portions of introns are included in 
the mature mRNA transcript, 
sporadic Refers to the occurrence of a disease 
in a family with no apparent genetic transmis¬ 
sion pattern (often the result of a new 
mutation). 

stop codon mRNA base triplets that specify the 
point at which translation of the mRNA ceases. 


submetacentric A chromosome in which the 
centromere is located closer to one end of the 
chromosome arm than the other. Compare 
with metacentric and acrocentric, 
subtelomeric rearrangements Alterations of 
chromosomes, principally deletions and dupli¬ 
cations, that occur close to the telomeres and 
can cause genetic disease, 
synapsis The pairing of homologous chromo¬ 
somes during prophase I of meiosis. 
syndrome A pattern of multiple primary mal¬ 
formations or defects all resulting from a 
single underlying cause (examples: Down syn¬ 
drome, Marfan syndrome), 
syntenic Describes two loci located on the same 
chromosome; they may or may not be linked. 
T lymphocyte or T cell A component of the 
adaptive immune system whose receptors 
bind to a complex of MHC molecule and 
foreign antigen. There are two major classes of 
T lymphocytes, helper T lymphocytes and 
cytotoxic T lymphocytes, 
tandem repeat DNA sequences that occur in 
multiple copies located directly next to one 
another. Compare with dispersed repetitive 
DNA. 

targeted disruption The disabling of a spe¬ 
cific gene so that it is not expressed, 
telomerase A transferase enzyme that replaces 
the DNA sequences in telomeres during cell 
division. 

telomere The tip of a chromosome. 

telophase The final major stage of mitosis and 
meiosis, in which daughter chromosomes are 
located on opposite edges of the cell and new 
nuclear envelopes form. 

template A strand of DNA that serves as the 
model for replication of a new strand. Also 
denotes the DNA strand from which mRNA is 
transcribed. 

temporal colinearity Correlation between 
the 5'-3' arrangement of genes along chromo¬ 
somes and the timing of their expression in 
development. 

teratogen A substance in the environment 
that can cause a birth defect, 
teratology The study of environmental factors 
that cause birth defects or congenital 
malformations. 

termination sequence A DNA sequence that 
signals the cessation of transcription, 
tetrad The set of four homologous chromatids 
(two sister chromatids from each homologous 
chromosome) observed during meiotic pro¬ 
phase I and metaphase I. Synonymous with 
bivalent. 

tetraploidy A polyploid condition in which the 
individual has four copies of each chromosome 
in each cell, for a total of 92 chromosomes, 
threshold of liability A value on the liability 
distribution (see liability distribution) below 
which an individual is unaffected and above 
which an individual is affected by disease, 
thymine One of the four DNA bases (abbre¬ 
viation: T). 

tissue-specific mosaicism A mosaic in 

whom the mosaicism is confined to only spe¬ 
cific tissues of the body. 
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transcription or transcribed The process in 
which an mRNA sequence is synthesized from 
a DNA template. 

transcription activator-like effector nucle¬ 
ase (TALEN) An artificially constructed 
restriction enzyme that consists of a transcrip¬ 
tion activator-like effector (TALE) and a 
nuclease (N). The TALE can be engineered to 
bind to any specific DNA sequence, and the 
nuclease component cleaves the DNA. Because 
they can cleave DNA at precise locations in the 
genome, TALENs are used to edit target DNA 
sequences to create animal models of diseases 
or other phenotypes. 

transcription factor Protein that binds to 
DNA to influence and regulate transcription, 
transcription factor, general A class of tran¬ 
scription factors that are required for the tran¬ 
scription of all structural genes, 
transcription factor, specific A class of tran¬ 
scription factors that activate only specific 
genes at specific points in time, 
transduction The transfer of DNA from one 
cell to another by a vector such as a plasmid 
or bacteriophage. 

transfection The transfer of a DNA sequence 
into a cell. 

transfer RNA (tRNA) A class of RNA that 
helps to assemble a polypeptide chain during 
translation. The anticodon portion of the 
tRNA binds to a complementary mRNA 
codon, and the 3' end of the tRNA molecule 
attaches to a specific amino acid, 
transformation The oncogenic conversion of 
a normal cell to a state of unregulated growth, 
transgenic Refers to an organism into which a 
gene has been introduced from an organism 
of another species (e.g., a transgenic mouse 
could contain an inserted human gene), 
translation The process in which an amino acid 
sequence is assembled according to the pattern 
specified by the mature mRNA transcript, 
translocation The exchange of genetic mate¬ 
rial between nonhomologous chromosomes, 
translocation, reciprocal A translocation 
resulting from breaks on two different 


chromosomes and a subsequent exchange of 
material. Carriers of reciprocal translocations 
maintain the normal number of chromo¬ 
somes and the normal amount of chromo¬ 
some material. 

translocation, Robertsonian A translocation 
in which the long arms of two acrocentric 
chromosomes are fused at the centromere; the 
short arms of each chromosome are lost. The 
translocation carrier has 45 chromosomes 
instead of 46 but is phenotypically normal 
because the short arms contain no essential 
genetic material. 

transposon See mobile element. 

triploidy A polyploid condition in which 
the individual has three copies of each chro¬ 
mosome in each cell, for a total of 69 
chromosomes. 

trisomy An aneuploid condition in which the 
individual has an extra copy of one chromo¬ 
some, for a total of 47 chromosomes in each 
cell. 

true negative Individual who is correctly 
identified by a test as not having a disease. See 
also specificity. 

true positive Individual who is correctly iden¬ 
tified by a test as having a disease. See also 
sensitivity. 

tumor See neoplasm. 

tumorigenesis Formation or production of a 
tumor. 

tumor suppressor gene A gene whose product 
helps to control cell growth and proliferation; 
mutations in tumor suppressors can lead to 
cancer (example: retinoblastoma gene, RBI). 

two-hit model of carcinogenesis A model 
of carcinogenesis in which both copies of a 
gene must be altered before a neoplasm can 
form. 

ultrasonography A technique for fetal visual¬ 
ization in which sound waves are transmitted 
through the fetus and their reflection patterns 
are displayed on a monitor. 

unequal crossover Crossing over between 
improperly aligned DNA sequences; produces 
deletions or duplications of genetic material. 


uninformative mating A mating in which 
linkage phase cannot be established, 
uniparental disomy A condition in which 
two copies of one chromosome are derived 
from a single parent, and no copies are derived 
from the other parent. May be either heterodi¬ 
somy or isodisomy. 

upstream Refers to the orientation of 
sequences along a gene—specifically the 5' 
direction of the coding strand of DNA or the 
RNA molecule. Compare with downstream, 
validity The degree to which a test measures 
accurately what it is intended to measure, 
variable expression A trait in which the same 
genotype can produce phenotypes of varying 
severity or expression (example: neurofibro¬ 
matosis type 1). 

variable number of tandem repeats 
(VNTRs) A type of polymorphism created 
by variations in the number of minisatellite 
repeats in a defined region, 
variable region A component of the immu¬ 
noglobulin molecule that recognizes and 
binds to specific antigens, 
variance A statistical measure of variation in a 
quantity; estimated as the sum of the squared 
differences from the average value, 
variants of unknown significance Varia¬ 
tions in DNA sequence whose functional 
or clinical consequences are not yet 
established. 

vector The vehicle used to carry a DNA insert 
(e.g., phage, plasmid, cosmid, BAG, or YAC). 
X inactivation Process by which genes from one 
X chromosome in each cell of the female embryo 
are rendered transcriptionally inactive. 

X inactivation center The location on the 
X chromosome from which the X inactivation 
signal is transmitted (includes the XIST 
gene). 

X-linked Refers to genes that are located on the 
X chromosome. 

yeast artificial chromosome (YAC) A syn¬ 
thesized yeast chromosome capable of carry¬ 
ing a large DNA insert (up to 1000 kb), 
zygote The diploid fertilized ovum. 
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CHAPTER 2 

1. The mRNA sequence is: 5'-CAG AAG AAA AUU AAC 
AUG UAA-3' (remember that transcription moves along 
the 3'-5' DNA strand, allowing the mRNA to be synthe¬ 
sized in the 5' to 3' direction). This mRNA sequence is 
translated in the 5' to 3' direction to yield the following 
amino acid sequence: Gln-Lys-Lys-lle-Asn-Met-STOR 

2. The genome is the sum total of our genetic material. It is 
composed of 23 pairs of nuclear chromosomes and the 
mitochondrial chromosome. Each chromosome contains 
a number of genes, the basic unit of heredity. Genes are 
composed of one or more exons; exons alternate with 
introns. Exons encode mRNA codons, which consist of 
three nucleotides each. It is important to remember that 
DNA coiling patterns also produce a hierarchy: Chromo¬ 
somes are composed of 100-kb chromatin loops, which 
are in turn composed of solenoids. Each solenoid contains 
approximately six nucleosomes. Each nucleosome con¬ 
tains about 150 DNA base pairs and may or may not 
include coding material. 

3. Approximately 55% of human DNA consists of repetitive 
sequences whose function is largely unknown. Single-copy 
DNA includes protein-coding genes, but it consists mostly 
of extragenic sequences and introns that do not encode 
proteins. Because individual cells have specialized func¬ 
tions, most make only a limited number of protein prod¬ 
ucts. Thus only a small percentage of the cell’s coding 
DNA is transcriptionally active at any given time. This 
activation is controlled by elements such as transcription 
factors, enhancers, and promoters. 

4. Mitosis is the cell division process whereby one diploid cell 
produces two diploid daughter cells. In meiosis, a diploid 
ceU produces haploid cells (gametes). Meiosis produces 
haploid cells because the centromeres are not duplicated 
in meiosis 1 and because there is no replication of DNA in 
the interphase stage between meiosis I and meiosis II. 
Another difference between mitosis and meiosis is that the 
homologous chromosomes form pairs and exchange 
material (crossing over) during meiosis 1. Elomologs do 
not pair during mitosis, and mitotic crossing over is very 
rare. 

5. Each mitotic division doubles the number of cells in the 
developing embryo. Thus the embryo proceeds from 1 to 
2 to 4 to 8 cells, and so on. After n cell divisions, there are 
2" cells. Eor example, after 10 divisions, there are 2'“, or 
1024, cells. We want a value of n that satisfies the simple 
relationship 2" = 10''‘. One way to find our answer is simply 
to plug in values of n until we get lO’^. A more elegant 
approach is to take the common logarithms of both sides 
of this equation, yielding «log(2) = 14log(10). Because the 


common logarithm of 10 is 1, we obtain the relationship 
n = 14/log(2). Thus, n = 46.5. This result, approximately 
46 to 47 ceU divisions, is only an average value. Some cell 
lineages divide more times than others, and many cells are 
replaced as they die. 

6. A total of 400 mature sperm and 100 mature egg cells will 
be produced. Each primary spermatocyte produces four 
mature sperm cells, and each primary oocyte produces 
only one mature egg cell (the other products of meiosis 
are polar bodies, which degenerate). 

CHAPTER 3 

1. Mutation 1 is a nonsense mutation in the fourth codon, 
which produces premature termination of translation. 
Mutation 2 is a frameshift mutation in the third codon, 
and mutation 3 is a missense mutation in the second 
codon. 

2. Transcription mutations generally lower the production of 
a gene product, but often they do not eliminate it com¬ 
pletely. Transcription mutations in the jl-globin gene 
usually produce (I^-thalassemia, a condition in which there 
is some production of (3-globin chains. (3^-thalassemia 
tends to be less severe than ^“-thalassemia. Missense muta¬ 
tions alter only a single amino acid in a polypeptide chain, 
and when they occur in the jl-globin chain they can 
produce [I^-thalassemia. (Elowever, keep in mind that 
sickle cell disease, which is relatively severe, is also caused 
by a missense mutation.) In contrast, frameshift mutations 
alter many or aU codons downstream from the site of the 
mutation, so a large number of amino acids may be 
changed. Frameshifts can also produce a stop codon. Non¬ 
sense mutations produce truncated polypeptides, which 
are often useless (especially if the nonsense mutation 
occurs near the 5' end of the gene, eliminating most of the 
polypeptide chain). Donor and acceptor mutations can 
delete whole exons or large portions of them. This deletion 
can substantially alter the amino acid composition of the 
polypeptide. Nonsense, frameshift, and donor or acceptor 
mutations all tend to produce the more-severe P“- 
thalassemia, in which no (3-globin chains are present. 

3. In thalassemia conditions, one of the chains, a- or 
(I-globin, is reduced in quantity. Most of the harmful con¬ 
sequences are caused by the relative excess of the chain that 
is produced in normal quantity. If both chains are reduced 
in quantity, there may be a rough balance between the two, 
resulting in less accumulation of excess chains. 

4. SNPs are single nucleotide polymorphisms, typically 
detected by microarrays or DNA sequencing. VNTRs 
consist of varying numbers of repeated DNA sequences. 
Because the number of tandem repeats can vary 
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considerably, VNTRs can have many different alleles in 
populations. VNTRs are found in minisatellite regions. 
STRs consist of variations in the number of shorter micro- 
satellite repeats (usually dinucleotides, trinucleotides, and 
tetranucleotides). They are detected with the use of the 
polymerase chain reaction (PCR). SNPs and VNTRs can 
also be detected using PCR, provided that the DNA 
sequences flanking the polymorphism are known. The 
autoradiogram represents an STR. This is indicated by the 
fact that there are multiple alleles (distinguishing it from 
an SNP) and by the fact that the various alleles differ in 
size by only 4 bp (recall from Chapter 2 that the tandem 
repeat units in minisatellite regions are generally 20 to 
70 bp long). 

5. Because the disease mutation destroys a recognition site, 
those who have the disease allele have a longer restriction 
fragment. This fragment migrates more slowly on the gel 
and is seen higher on the autoradiogram. Individual A has 
only the longer fragment and thus has two copies of the 
disease mutation. This individual has tti-antitrypsin defi¬ 
ciency. Individual B has only the short fragment and is 
genetically and physically unaffected. Individual C has 
both fragments and is thus a clinically unaffected 
heterozygote. 

6. In this sample of 100 individuals, there are 88 x 2 HbA 
alleles in the HbA homozygotes and 10 HbA alleles in the 
heterozygotes. There are thus 186 HbA alleles in the popu¬ 
lation. The frequency of HbA, p, is 186/200 = 0.93, and the 
frequency of HbS, q, is 1 - 0.93 = 0.07. The genotype fre¬ 
quencies in the population are 88/100 = 0.88, 10/100 = 
0.10, and 2/100 = 0.02 for the HbA/HbA, HbA/HbS, and 
HbS/HbS genotypes, respectively. Assuming Hardy-Wein¬ 
berg proportions, the expected genotype frequencies are 
given by p^, 2pq, and cf, respectively. This yields expected 
genotype frequencies of (0.93)^ = 0.865, 2 x 0.93 x 0.07 = 
0.130, and (0.07)^ = 0.005, respectively. In this population, 
the observed and expected genotype frequencies are fairly 
similar to each other. 

7. For an autosomal recessive disease, the prevalence 
(1/10,000) equals the recessive genotype frequency, q^. 
Thus the PKU gene frequency, q, is given hy ^/cf = 
a/i/10,000 = 1/100 = 0.01. The carrier frequency is given 
by 2pq, which is approximately 2q, or 0.02 (i.e., 1/50). 

CHAPTER 4 

1. Because this is an autosomal dominant disorder, and 
because affected homozygotes die early in life, the man is 
a heterozygote and has a 50% chance of passing the dis¬ 
ease-causing allele to each of his offspring. The probability 
that all four will be affected is given by the product of each 
probability: (l/2)'‘ = 1/16. The probability that none 
will be affected (1/16) is obtained in exactly the same 
way. 

2. The probability that the offspring will inherit the retino¬ 
blastoma susceptibility allele is 0.50, because familial reti¬ 
noblastoma is an autosomal dominant disease. However, 


we must also consider the penetrance of the disorder. 
The probability of both inheriting the disease-causing 
allele (0.50) and expressing the disease phenotype (0.90) 
is given by multiplying the two probabilities together: 0.90 
X 0.50 = 0.45. 

3. Because the woman’s sister had Tay-Sachs disease, both 
parents must be heterozygous carriers. This means that, at 
birth, one fourth of their offspring will be affected, one 
half wUl be carriers, and one fourth wUl be genetically 
normal. Note, however, that the woman in question is 30 
years old. She cannot possibly be an affected homozygote 
because affected individuals die by age 6 years. There are 
thus three equally likely possibilities: (1) the disease allele 
was inherited from the mother and a normal allele was 
inherited from the father; (2) the disease allele was inher¬ 
ited from the father and a normal allele was inherited from 
the mother; (3) normal alleles were inherited from both 
parents. Because two of these three possibilities lead to the 
carrier state, the woman’s probability of being a heterozy¬ 
gous carrier is 2/3. 

4. Because the mother has neurofibromatosis and can be 
assumed to be a heterozygote (homozygotes for this con¬ 
dition have not been reported), the probability that her 
daughter (the man’s sister) inherits the disease-causing 
allele is 1/2. The probability that the sister transmits the 
disease-causing allele to her daughter is again 1/2, so the 
probability that both events occur is 1/2 x 1/2 = 1/4. If we 
knew that the man’s sister was affected, then the probabil¬ 
ity that his sister’s daughter is affected is simply 1/2. 

5. The probability that the woman’s disease-causing allele is 
transmitted to her offspring is 1/2, and the probability 
that this offspring in turn transmits the disease-causing 
allele to his or her offspring (i.e., the grandchild) is again 
1/2. Thus, the probability that one grandchild has inher¬ 
ited the allele is 1/2 x 1/2, or 1/4. Similarly, the probability 
that the other grandchild has inherited the allele is 1/4. 
The probability that both grandchildren have inherited 
the allele is 1/4 x 1/4 = 1/16. If the grandmother has PKU, 
she must be homozygous for the disease-causing allele. 
Thus, both of her children must be heterozygous carriers 
(probability =1). The probability that one of these indi¬ 
viduals transmits the disease-causing allele to their off¬ 
spring is 1/2. The probability that they both transmit the 
disease-causing allele to their offspring (i.e., that both 
grandchildren are heterozygous carriers) is 1/2 x 1/2 = 
1/4. 

6. The coefficient of relationship is (1/2)*’, or 1/64. This gives 
the probability that the second member of the couple 
carries the PKU allele. The probability that two carriers 
will produce an affected offspring is 1/4. The overall prob¬ 
ability that this couple will produce a baby affected with 
PKU is given by multiplying the probability that the mate 
also carries the allele (1/64) by the probability that the 
couple both transmit the allele to their offspring (1/4): 
1/64 X 1/4 = 1/256. This demonstrates that the probability 
of producing an affected offspring in this consanguineous 
mating is actually quite small. 
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7. The frequency of the heterozygote genotype in the general 
population is given by 2pq, according to the Hardy-Wein- 
berg law. Thus, the frequency of the first genotype in the 
general population is predicted to be 2 x 0.05 x 0.10 = 0.01. 
Similarly, for the second system, the frequency of hetero- 
zygotes in the general population is 2 x 0.07 X 0.02 = 
0.0028. The perpetrator was a homozygote for the third 
system; the frequency of the homozygote genotype in the 
general population is given by p^, or 0.08^ = 0.0064. If we 
can assume independence of the three STR loci in the 
general population, we can then multiply the frequencies 
of the three genotypes together to get the probability that 
a randomly chosen individual in the general population 
would have the same genotypes as the perpetrator. We thus 
multiply 0.01 x 0.0028 x 0.0064 to obtain a probability of 
0.000000179, or 1/5,580,357. This probability can be made 
smaller by typing additional loci. 

8. As in question 7, we assume independence of the four STR 
loci. In this case, we multiply the four allele frequencies 
together to get the probability: 0.05 X 0.01 x 0.01 x 0.02 = 
0.0000001 = 1/10,000,000. Notice a key difference between 
question 8 and question 7: In the paternity case, the father 
has contributed only half of the baby’s genotype at each 
locus, the other half being contributed by the mother. 
Thus, we examine only a single allele for each locus. In 
contrast, the rapist has contributed both alleles of each 
genotype to the evidentiary sample, so we need to know 
the frequency of each genotype in the general population. 
We use the Hardy-Weinberg law to estimate the popula¬ 
tion frequency of each genotype, based on the known 
allele frequencies. 

CHAPTER 5 

1. There are four Barr bodies, always one less than the 
number of X chromosomes. 

2. This is most likely a result of X inactivation. The hetero¬ 
zygotes with muscle weakness are the ones with relatively 
large proportions of active X chromosomes containing 
the mutant allele. 

3. The disease frequency in males is q, and in females it is 
q^. The male-to-female ratio is thus q/q^ or l/q. Thus, as 
q decreases, the male-to-female ratio increases. 

4. Because the male’s grandfather is affected with the disor¬ 
der, his mother must be a carrier. His father is phenotypi- 
cally normal and therefore does not have the disease 
gene. Thus, the male in question has a 50% risk of devel¬ 
oping hemophilia A. His sister’s risk of being a heterozy¬ 
gous carrier is also 50%. Her risk of being affected with 
the disorder is close to zero (barring a new mutation on 
the X chromosome transmitted by her father). 

5. Male-to-male transmission can be observed in autoso¬ 
mal dominant inheritance, but it is not observed in 
X-linked dominant inheritance. Thus, males affected 
with X-linked dominant disorders must always have 
affected mothers, unless a new mutation has occurred. 
Males and females are affected in approximately equal 


proportions in autosomal dominant inheritance, but 
there are twice as many affected females as males in 
X-linked dominant inheritance (unless the disorder is 
lethal prenatally in males, in which case only affected 
females are seen). In X-linked dominant inheritance, all 
of the sons of an affected male are normal, and all of the 
daughters are affected. In X-linked dominant diseases, 
heterozygous females tend to be more mildly affected 
than are hemizygous males. In autosomal dominant 
inheritance, there is usually no difference in severity of 
expression between male and female heterozygotes. 

6. In mitochondrial inheritance, the disease can be inher¬ 
ited only from an affected mother. In contrast to all other 
types of inheritance, no descendants of affected fathers 
can be affected. Note that males with an X-linked reces¬ 
sive disease who mate with normal females cannot trans¬ 
mit the disease to their offspring, but their grandsons can 
be affected with the disease. 

7. Because Becker muscular dystrophy is relatively rare, it is 
reasonable to assume that the woman is a normal homo¬ 
zygote. Thus, she can transmit only normal X chromo¬ 
somes to her offspring. Her male offspring, having 
received a Y chromosome from the father and a normal 
X from the mother, will all be unaffected. All female 
offspring of this mating will receive a mutated X chromo¬ 
some from the father and all will be heterozygous carriers 
of the disorder. 

8. If the man is phenotypically normal, his X chromosome 
cannot carry a Duchenne muscular dystrophy mutation. 
The female will transmit her mutation-carrying chromo¬ 
some to half of her offspring, on average. Thus, half of 
the sons will be affected with the disorder, and half of the 
daughters will be heterozygous carriers. 

9. The boys’ mother must be a heterozygous carrier for a 
factor VIII mutation. Thus, one of the mother’s parents 
must also have carried the mutation. Consequently, 
the probability that the mother’s sister is a carrier is 
1 / 2 . 

10. This is explained by genomic imprinting. For normal 
development, differential expression of genes inherited 
from the father and mother is necessary. If the expression 
pattern from only one parent is inherited, the embryo 
cannot develop normally and dies. The experiment works 
in amphibians because genomic imprinting does not 
occur in these animals. 

CHAPTER 6 

1. Euploid cells have a multiple of 23 chromosomes. 
Haploid (« = 23), diploid (n = 46), and polyploid 
(triploid and tetraploid) cells are all euploid. Aneuploid 
cells do not have a multiple of 23 chromosomes 
and include trisomies (47 chromosomes in a somatic 
cell) and monosomies (45 chromosomes in a somatic 
cell). 

2. Conventional FISH analysis involves the hybridization of 
a fluorescently labeled probe to denatured metaphase 
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chromosomes and is most useful in detecting aneuploidy, 
deletions, and chromosome rearrangements. FISH can 
also be extended to use multiple differently colored 
probes, allowing the detection of several different aneu- 
ploidies simultaneously. Spectral karyotyping is an exten¬ 
sion of FISH in which each chromosome can be visualized 
as a different color. This enables accurate and easy 
characterization of aneuploidy, loss or duplication of 
chromosome material, and (especially) chromosome 
rearrangements such as translocations. CGH is particu¬ 
larly useful in detecting the gain or loss of chromosome 
material, but it cannot detect balanced rearrangements 
of chromosomes, such as reciprocal translocations. This 
is because, despite the rearrangement, the test sample and 
the reference sample each have the same amount of DNA 
from each chromosome region. 

3. A normal egg can be fertilized by two sperm cells (dispermy, 
the most common cause of triploidy). An egg and polar 
body can fuse, creating a diploid egg, which is then fertil¬ 
ized by a normal sperm cell. Diploid sperm or egg cells can 
be created by meiotic failure; subsequent union with a 
haploid gamete would produce a triploid zygote. 

4. The difference in incidence of various chromosome 
abnormalities reflects the fact that embryos and fetuses 
with chromosome abnormalities are spontaneously lost 
during pregnancy. The rate and timing of loss vary 
among different types of chromosome abnormalities. 

5. A karyotype establishes whether the condition is the 
result of a true trisomy or a translocation. If the trisomy 
is a translocation, the recurrence risk in future pregnan¬ 
cies is greatly elevated. A karyotype also helps to establish 
whether the patient is a mosaic. This can help to predict 
and explain the severity of expression of the disorder. 

6. The risk, from lowest risk to highest, is: (1) 25-year-old 
woman with one previous Down syndrome child 
(approximately 1%); (2) 25-year-old male carrier of a 
21/14 translocation (1-2%); (3) 45-year-old woman with 
no family history (approximately 3%); (4) 25-year-old 
female carrier of a 21/14 translocation (10-15%). 

7. Nondisjunction of the X chromosome can occur in both 
meiosis I and meiosis II. If these two nondisjunctions 
occur in the same cell, an ovum with four X chromosomes 
can be produced. If this is fertilized by an X-bearing sperm 
cell, the zygote will have the 49,XXXXX karyotype. 

8. The meiotic error must have occurred in the father, 
because his X chromosome carries the gene for hemo¬ 
philia A. Because the daughter has normal factor VllI 
activity, she must have inherited her single X chromo¬ 
some from her mother. 

9. A loss of genetic material usually produces more severe 
consequences than does a gain of material, so one would 
expect the patient with the deletion—46,XY,del(8p)—to 
be more severely affected than the patient with the 
duplication. 

10. A translocation can place a proto-oncogene near a 
sequence that activates it, producing a cancer-causing 
oncogene. It can also interrupt a tumor-suppressor gene 


(see Chapter 11), inactivating it. Because these genes 
encode tumor-suppressing factors, their inactivation can 
also lead to cancer. 

CHAPTER 7 

1. In consanguineous matings, a higher fraction of genes in 
the offspring are identical by descent. The identical frac¬ 
tion is measured by the coefficient of relationship. The 
carrier frequency of autosomal recessive inborn errors of 
metabolism such as alkaptonuria decreases as the preva¬ 
lence of the disorder diminishes. Thus, the carrier fre¬ 
quency of very rare disorders is very low. When 
alkaptonuria is diagnosed in a child, it is reasonable to 
suspect that parents who might be related are more likely 
to share an individual’s alkaptonuria gene than two indi¬ 
viduals chosen at random from the population. 

2. Many metabolic reactions can proceed in the complete 
absence of an enzyme. For example, a hydroxide ion can 
combine with carbon dioxide to form bicarbonate. Yet 
this reaction occurs much more efficiently in the pres¬ 
ence of a catalyst, in this case the enzyme carbonic anhy- 
drase. Although many reactions in the human body 
would continue in the absence of an enzymatic catalyst, 
they would not do so at a rate high enough to support 
normal metabolism and physiology. 

3. Although most inborn errors of metabolism are rare, in 
aggregate they contribute substantially to the morbidity 
and mortality of children and adults. Additionally, 
understanding the pathogenic basis of rare metabolic 
disorders has the potential to help physicians and scien¬ 
tists understand similar processes that contribute to 
common diseases. For example, by understanding how 
mutations in glucokinase cause hyperglycemia, we might 
better understand the pathogenesis of diabetes mellitus. 

4. Metabolic disorders have been classified in many different 
ways. In Chapter 7, we categorized them by the types of 
metabolic processes that are affected. Some examples 
include carbohydrate metabolism (e.g., galactosemia, 
hereditary fructose intolerance, glycogen storage disor¬ 
ders); amino acid metabolism (e.g., PKU, MSUD, tyrosin- 
emia); lipid metabolism (e.g., MCAD, LCAD); degradative 
pathways (e.g., Hurler syndrome, OTC deficiency, Gaucher 
disease); energy production (e.g., OXPHOS defects); and 
transport systems (e.g., cystinosis, cystinuria). 

5. Mutations in GAL-l-P uridyl transferase are the most 
common cause of galactosemia. However, mutations in 
genes encoding other enzymes necessary for the metabo¬ 
lism of galactose, such as galactokinase and uridine 
diphosphate galactose-4-epimerase, can also result in 
galactosemia. This is an example of genetic heterogeneity. 
That is, indistinguishable phenotypes may be produced 
by mutations in different genes. Other examples of meta¬ 
bolic disorders that are genetically heterogeneous include 
hyperphenylalaninemia, MSUD, and cystinuria. 

6. The prevalence rates for many inborn errors of metabo¬ 
lism vary widely among ethnic groups. In many instances 
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this is due to founder effect and genetic drift (see Chapter 
3). For example, in Finland more than 30 autosomal 
recessive disorders are found at an unusually high preva¬ 
lence compared with closely related populations. A 
similar scenario partly explains the finding of only a 
single or few disease-causing mutations in Ashkenazi 
lews (lysosomal storage disorders), Mennonites (MSUD), 
and French Canadians (tyrosinemia type 1). In other 
cases, it appears that there may be a selective advantage 
for carriers of a recessive allele (remember that heterozy¬ 
gotes for recessive disorders are unaffected). This might 
explain the varying frequency distribution of LAC*P, 
which confers persistent lactase activity and may have 
been advantageous in populations in which dairy prod¬ 
ucts were a valuable nutritional source. 

7. Although the mitochondrial genome is inherited only 
from the mother, most of the proteins in the mitochon¬ 
dria are encoded by genes in the nuclear genome. Thus, 
disorders of mitochondrial fatty acid oxidation are inher¬ 
ited in an autosomal recessive pattern. Given the age of 
this young woman, it is unlikely that she is affected with 
a mitochondrial fatty acid oxidation disorder, so we can 
assume that she either is a heterozygous carrier or is 
homozygous for the normal allele. Consequently, the 
probability that she is a carrier is 0.67. The probability 
that her mate is a carrier is given by the carrier rate for 
MCAD in the general population: 1 in 70 (=0.014). In a 
carrier x carrier mating, there is a 25% chance of having 
an affected child (probability of two gametes bearing 
mutant alleles uniting); then the woman’s risk of having 
an affected child is 0.67 x 0.014 X 0.25 = 0.002 or 1/500. 
Note that her risk of being a carrier (0.67/0.014) is 48 
times higher than in the general population. 

8. Five different enzymes control the urea cycle pathway. 
Deficiencies of four of these enzymes (AS, ASA, arginase, 
and CPS) are inherited in an autosomal recessive pattern. 
OTC deficiency is an X-linked recessive condition. Most 
women who are carriers for an X-linked recessive disor¬ 
der are not symptomatic. However, there are at least two 
explanations for a symptomatic female. First, one X chro¬ 
mosome is normally inactivated (lyonized) at random in 
each somatic cell. Sometimes, inactivation is skewed, 
with more normal than abnormal X chromosomes being 
inactivated, and the woman becomes symptomatic. 
Testing for skewed inactivation is not commonly per¬ 
formed in diagnostic laboratories. A second possible 
explanation is that only a single X chromosome is present 
in each somatic cell (Turner syndrome; see Chapter 6). 
Thus, X inactivation does not take place because each cell 
must contain one functional copy of the X chromosome. 
If this X chromosome contains a mutated gene (e.g., 
OTC), a woman will be symptomatic. The simplest test 
to confirm this diagnosis is a karyotype. 

9. A functioning OXPHOS system is necessary to metabo¬ 
lize pyruvate aerobically. Defects of the OXPHOS system 
cause pyruvate to be metabolized anaerobically into 
lactate, increasing the level of circulating lactate. Thus, an 


elevated concentration of circulating lactate might signal 
the presence of a disorder of energy production. Increased 
levels of lactate are also produced by decreased tissue 
oxygenation due to decreased circulation (e.g., strenuous 
exercise, shock). Often there are other biochemical 
abnormalities that suggest a defect of OXPHOS; however, 
the diagnosis can be challenging. 

10. Polymorphisms in the genes controlling the metabolism 
of drugs (e.g., CYP2D6) can affect a patient’s therapeutic 
response as well as the profile of side effects. Natural 
foods contain thousands of chemical compounds, many 
of which have pharmacological properties similar to, but 
less potent than, contemporary drugs used by health-care 
providers. Thus, polymorphisms in the genes might have 
enabled some groups of hunter-gatherers to use resources 
that other groups found inedible. Over time this might 
have been enough of a selective advantage that the size 
of one group grew more quickly than the size of others. 

CHAPTER 8 

1. The affected male in generation II inherited the disease 
allele and marker allele 1 from his affected father, and he 
inherited a normal allele and marker allele 2 from his 
mother. Therefore, the disease allele must be on the chro¬ 
mosome that contains marker allele 1 in this male (linkage 
phase). Because he married a female who is heterozygous 
for marker alleles 3 and 4, we expect to observe allele 1 in 
the affected offspring under the hypothesis of linkage. 
Individual III-5 has the 2,4 marker genotype but is affected, 
and individual III-7 has the 1,3 genotype but is normal. 
They both represent recombinants. Thus, there are two 
recombinations observed in eight meioses, giving a recom¬ 
bination frequency of 2/8, or 25%. 

2. For marker A, the affected mother in generation II must 
carry allele 2 on the same chromosome as the Huntington 
disease allele. Under the hypothesis that 9 = 0.0, aU of her 
children must also inherit allele 2 if they are affected. 
Marker A shows a recombinant with the disease allele in 
individual III-5 and thus produces a likelihood of zero for 
a recombination frequency of 0.0. The LOD score is -oo; 
(the logarithm of 0). For marker B, the disease allele is on 
the chromosome bearing marker allele 1 in the affected 
mother in generation II. All offspring inheriting allele 1 
also inherit the disease allele, so there are no recombi¬ 
nants. Under the hypothesis that 0 = 0.0, the affected 
mother can transmit only two possible haplotypes: the 
disease allele with marker allele 1 and the normal allele 
with marker allele 2. The probability of each of these 
events is 1/2. Thus, the probability of observing six off¬ 
spring with the marker genotypes shown is (1/2)® = 1/64. 
This is the numerator of the likelihood ratio. Under the 
hypothesis that 0 = 0.5 (no linkage), four possible haplo¬ 
types can be transmitted, each with probability 1/4. The 
probability of observing six children with these haplotypes 
under the hypothesis of no linkage is then (1/4)® = 1/4096. 
This is the denominator of the likelihood ratio. The ratio 
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is then (l/64)/( 1/4096) = 64. The LOD score is given by 
the common logarithm of 64, 1.8. 

3. The table shows a maximum LOD score, 3.5, at a recom¬ 
bination frequency of 10%. Thus, the two loci are most 
likely to be linked at a distance of approximately 10 cM. 
The odds in favor of linkage at this q value, versus nonlink¬ 
age, are 3162 (or 10^'^) to 1. 

4. Linkage phase can be established in both families, and no 
recombinants are observed in either family. Thus, the esti¬ 
mated recombination frequency is 0.0. The LOD score for 
0 = 0.0 in the first family is given by logio(l/2)V(l/4)^ = 
logio(32) = 1.5. In the second family the LOD score for 
0 = 0.0 is logio{l/2)'’/(l/4)‘’ = logio(64) = 1.8. These two 
LOD scores can then be added to obtain an overall LOD 
score of 3.3. 

5. These matings allow us to establish linkage phase in indi¬ 
viduals 11-1 and II-2, the parents of the individuals in 
generation 111. The disease allele is on the same chromo¬ 
some as marker 4 in individual 11-1 and on the same 
chromosome as marker 5 in individual 11-2. Under the 
hypothesis of linkage, we would predict that offspring who 
inherit markers 4 and 5 will be homozygous for the disease 
allele and, thus affected, offspring who inherit either 4 or 
5 will be heterozygous carriers, and offspring who inherit 
neither 4 nor 5 will be homozygous normal. Note a key 
difference between autosomal recessive and autosomal 
dominant pedigrees in estimating recombination frequen¬ 
cies: Here, both parents in generation 11 contribute infor¬ 
mative meioses because both parents carry a disease-causing 
allele. Thus, we can evaluate all 10 meioses contributing 
to generation III for recombination between the disease 
and marker loci. No recombinations are seen in the first 
four offspring; however, individual 111-5 is homozygous 
normal but has inherited allele 5 from his mother. Thus, 
one recombination in five meioses has occurred in the 
mother, and no recombinations in five meioses have 
occurred in the father. One recombination in 10 meioses 
yields a recombination frequency of 1/10 = 10%. 

6. These matings allow us to establish linkage phase in indi¬ 
vidual 11-1. Under the hypothesis of linkage, she carries 
the disease-causing allele (labeled D) on the same chromo¬ 
some as marker allele 1. Her haplotypes are thus Dl/d2. 
Based on her haplotypes and on those of her unaffected 
mate, we can predict that the offspring who inherit the 1,1 
genotype will be affected, and those who inherit the 1,2 
genotype will not be affected. We see that this is the case 
for all but one of the offspring (111-5). This means that the 
likelihood that 0 = 0.0 is zero, so the LOD score for this 
recombination frequency is -oo. To assess the LOD score 
for 0 = 0.1, we consider that the probability that the father 
passes each recombinant haplotype (D2 or dl) is 0/2 = 
0.05. The probability that he transmits each nonrecombi¬ 
nant haplotype (Dl or d2) is (1 - 0)/2 = 0.45 (see Box 8-1 
for the details of this reasoning process). There is one 
recombinant and seven nonrecombinants among the off¬ 
spring. The probability of observing these eight events is 
0.05 X (0.45)^ = 0.00019. This is the numerator of the 


likelihood ratio. For eight offspring, the probability that 
0 = 0.5 is (1/4)® = 0.000015. This is the denominator of 
the ratio. The logarithm of the odds ratio is given by 
logio(12.2) = 1.09. This is the LOD score for 0 = 0.1. 

7. The mating in generation II is uninformative because the 
father is a homozygote for the marker allele. It will be 
necessary to type the family for another closely linked 
marker (preferably one with more alleles) before any risk 
information can be given. At this point, the only risk infor¬ 
mation that can be given is that each child has a 50% risk 
of inheriting the disease gene from the affected father. 

8. Synteny refers to loci that are on the same chromosome. 
Linkage refers to loci that are less than 50 cM apart on a 
chromosome; alleles at such loci tend to be transmitted 
together within families. Linked loci are thus syntenic, 
but syntenic loci are not necessarily linked. Linkage disequi¬ 
librium is the nonrandom association of alleles at linked 
loci, observable when chromosome haplotypes are exam¬ 
ined in populations. Association indicates that two traits 
are observed together in a population more often than 
expected by chance; the traits might or might not be 
genetic. Association thus does not necessarily have any¬ 
thing to do with linkage, unless we are referring to linkage 
disequilibrium. 

9. Linkage disequilibrium can arise when a single disease- 
causing mutation first occurs on a chromosome copy that 
contains specific nearby marker alleles. At first, the muta¬ 
tion wiU be seen only on copies of the chromosome that 
contain the specific marker alleles. On the other hand, if 
there are multiple disease-causing mutations at a locus like 
NFl, they are likely to occur on chromosome copies with 
different marker alleles, and little association will be 
observed between the disease genotype (which actually 
consists of a collection of different mutations at the disease 
locus) and a specific marker allele. 

CHAPTER 9 

1. The class I molecules present peptides at the surfaces of 
nearly all of the body’s cells. The peptide-class I molecule 
complex is recognized by cytotoxic T cells, which kill the 
cell if the class 1 MHC molecule presents foreign peptide. 
The class II MHC molecules also present peptides at the 
cell surface, but only in antigen-presenting cells of the 
immune system (e.g., dendritic cells, macrophages, B 
cells). If the class 11 molecule presents foreign peptides 
derived from an invading microbe, they are bound by the 
receptors of helper T cells; this in turn stimulates appro¬ 
priate B cells to proliferate and to produce antibodies that 
will help to kill the microbes. 

2. Immunoglobulins differ among B cells within individuals, 
so that a large variety of infections can be combated. MHC 
molecules are identical on each cell surface within an 
individual, but they vary a great deal among individuals. 
This interindividual variability may have evolved to 
prevent infectious agents from spreading easily through 
a population. 
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3. T-cell receptors and immunoglobulins are similar in that 
they are both cell-surface receptors that bind to foreign 
peptides as part of the immune response. Diversity in both 
types of molecules is generated by multiple germline 
genes, VDJ recombination, and junctional diversity. They 
differ in that immunoglobulins are secreted into the cir¬ 
culation (as antibodies) and can bind directly to foreign 
peptide, whereas T-cell receptors are not secreted and 
must “see” foreign peptides in conjunction with MHC 
molecules to recognize them. Also, somatic hypermuta¬ 
tion generates diversity in immunoglobulins but not in 
T-cell receptors. 

4. Somatic recombination alone could produce 30 x 6 x 80 
= 14,400 different heavy chains of this class. 

5. The probability that a sibling will be HLA-identical is 0.25. 
The probability that two siblings share one gene or haplo- 
type is 0.50 (the coefficient of relationship for siblings; see 
Chapter 4). Then, the probability that they share two hap- 
lotypes and are HLA-identical is 0.50 x 0.50 = 0.25. 

6. If a homozygous Rh-positive (DD) man mates with an 
Rh-negative woman, all of the offspring will be Rh-posi¬ 
tive heterozygotes (Dd) and incompatible with the mother. 
If the man is an Rh-positive heterozygote (Dd), half of the 
children, on average, will be incompatible Rh-positive het¬ 
erozygotes. However, if the couple is ABO-incompatible, 
this will largely protect against Rh incompatibility. 

CHAPTER 10 

1. Animals such as roundworm, fruit fly, frog, zebra fish, 
chick, and mouse are commonly used to model human 
development. Each of these organisms has a relatively 
short generation time and can be selectively bred, and 
large populations can be maintained in captivity. Further¬ 
more, embryos from each of these animals can be manipu¬ 
lated either in vitro or in vivo using a variety of techniques 
(e.g., surgical ablation or transplantation of cells or tissues, 
ectopic expression of native genes or transgenes, knock¬ 
outs). Of course, each organism has advantages and dis¬ 
advantages that must be considered when choosing an 
appropriate model. In some animals, naturally occurring 
mutant strains have proved to be valuable models of 
human genetic disease. For example, mutations in murine 
Pax6 produce an abnormally small eye. Mutations in the 
human homolog, PAX6, cause hypoplasia or absence of 
the iris. Most of what we understand about axis determi¬ 
nation and pattern formation has been learned from 
studies of nonhuman animal models. 

2. Although identical mutations in a developmental gene can 
produce different birth defects, the defects are usually the 
same within each family. One possible explanation is that 
other genes may be modifying the phenotype differently 
in each family. In other words, the same mutation is occur¬ 
ring in different genetic backgrounds, resulting in differ¬ 
ent phenotypes. To date, only a few examples of different 
human conditions caused by identical mutations are 
known. However, strain-dependent phenotypic effects are 


well described in other organisms such as mice and fruit 
flies. 

3. Transcription factors commonly activate or repress more 
than a single gene. Often they affect the transcription of 
many genes that may be members of different develop¬ 
mental pathways. This maintains developmental flexibility 
and genomic economy. These developmental pathways are 
used to build many different tissues and organs. Thus, a 
mutation in a transcription factor can affect the growth 
and development of many body parts. For example, muta¬ 
tions in TBX5 cause defects of the limbs and the heart in 
a pleiotropic disorder called Holt-Oram syndrome. 

4. Pattern formation is the process by which ordered spatial 
arrangements of differentiated cells create tissues and 
organs. Initially, the general pattern of the animal body 
plan is established. Then, semiautonomous regions are 
formed that will pattern specific organs and appendages. 
Thus, pattern formation requires that complex temporal- 
spatial information be available to different populations 
of cells at different periods during development. This 
information is communicated between cells by signaling 
molecules. Mutations in the genes encoding these signal¬ 
ing molecules can disrupt signaling pathways, leading to 
abnormal pattern formation. For example, Sonic hedge¬ 
hog (SHH) is widely used in many different patterning 
processes, including that of the brain. Mutations in SHH 
can cause failure of the forebrain to divide (holoprosen- 
cephaly). Patterning defects are also produced by muta¬ 
tions in genes encoding transcription factors (e.g., Hox, 
T-box) that are activated in response to signals from other 
cells. 

5. Because of the tight constraints on developmental pro¬ 
grams, the role of some elements in developmental path¬ 
ways can be played by more than one molecule. This is 
known as functional redundancy. Hox paralogs appear to 
act in some developmental pathways in just this manner. 
For example, mice with Hoxall or Hoxdll mutations 
have only minor abnormalities, whereas Hoxall/Hoxdll 
double mutants exhibit a marked reduction in the size of 
the radius and ulna. Thus, Hoxal 1 and Hoxdl 1 may be 
able to partially compensate for each other in some devel¬ 
opmental programs. 

6. There are many reasons why it may be impractical to use 
an organism to study loss-of-function mutations by creat¬ 
ing knockouts. For example, baboons have small nuclear 
families and long generation times and are expensive to 
maintain in captivity. A short generation time is critical, 
because complete loss of function is achieved by back- 
crossing chimeras and subsequently mating heterozygotes 
with a genetic modification to produce animals homozy¬ 
gous for a genetic modification (i.e., a knockout). One way 
to circumvent the problem of long generation times would 
be to produce sperm from the model animal (e.g., baboon) 
in an organism with a short generation time (e.g., mouse). 
In this technique, immature sperm-making cells from an 
infant baboon would be placed in the testicles of mice to 
make mature sperm. These sperm could be used for in 
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vitro fertilization of eggs harvested from female baboons 
with a genetic modification (i.e., chimeras or heterozy¬ 
gotes). In this manner, the generation time would be sub¬ 
stantially reduced. This type of experiment has not yet 
been successfully accomplished, although the technology 
is forthcoming. 

7. Cells communicate with each other through many different 
signaling pathways. Signaling requires that a ligand bind to 
a receptor molecule. This generates a response that might 
activate or inhibit a variety of processes within a cell. Muta¬ 
tions in fibroblast growth factor receptor (FGFR) genes 
cause a variety of craniosynostosis syndromes, and muta¬ 
tions in fibroblast growth factor genes have been associated 
with cleft lip and/or cleft palate. Mutations in the genes 
encoding endothelin-3 and its receptor, endothelin-B, 
cause defects of the enteric cells of the gastrointestinal 
tract, leading to severe, chronic constipation (Hirschsprung 
disease). This demonstrates that a mutation in a ligand or 
its receptor can produce the same phenotype. 

8. There are many obstacles to treating birth defects with 
gene therapy. Many developmental genes are expressed 
early in development. Thus, the phenotype would have to 
be identifiable at a very early age. This is certainly feasible 
(e.g., via preimplantation testing) in the gametes or zygotes 
of a known carrier. Because many of these genes encode 
axis, pattern, or organ-specific information, gene therapy 
would have to be initiated very early in the developmental 
process. In addition, the therapy might have to be targeted 
to specific areas, at critical times during development, and 
at appropriate levels required for interaction with other 
developmental genes. Consequently, it will be challenging 
to develop strategies that may be useful for treating birth 
defects with gene therapy. 

CHAPTER 11 

1. G6PD is on a portion of the X chromosome that is inac¬ 
tivated in one copy in normal females. Thus, any single cell 
will express only one G6PD allele. If all tumor cells express 
the same G6PD allele, this indicates that they all arose 
from a single ancestral ceU. This evidence was used to 
support the theory that most tumors are monoclonal. 

2. The probability that a single cell will experience two muta¬ 
tions is given by the square of the mutation rate per cell 
(i.e., the probability of one mutation and a second muta¬ 
tion in the same cell): (3 x 10“*’)^ = 10“". We then multiply 
this probability by the number of retinoblasts to obtain the 
probability that an individual will develop sporadic retino¬ 
blastoma: 10^“ X 2 X 10*’ = 2 X 10“^. We would thus expect 

2 in 100,000 individuals to develop sporadic retinoblas¬ 
toma, which is consistent with observed prevalence data 
(i.e., retinoblastoma is seen in about 1/20,000 children, and 
about half of these cases are sporadic). If an individual has 
inherited one copy of a mutant retinoblastoma gene, then 
the number of tumors is given by the rate of somatic muta¬ 
tion (second hit) per ceU times the number of target cells: 

3 X 10“'’ X 2 X 10'’ = 6 tumors per individual. 


3. Oncogenes are produced when proto-oncogenes, which 
encode substances that affect cell growth, are altered. 
Oncogenes usually act as dominant genes at the level of 
the cell and help to produce transformation of a normal 
cell into one that can give rise to a tumor. Tumor suppres¬ 
sor genes are also involved in growth regulation, but they 
usually act as recessive genes at the level of the cell (i.e., 
both copies of the gene must be altered before progression 
to a tumor can proceed). Because only a single oncogene 
need be altered to initiate the transformation process, 
oncogenes have been detected by using transfection and 
retroviral assays and by observing the effects of chromo¬ 
some translocations. Such methods are less effective for 
uncovering tumor suppressors, because two altered copies 
of these genes must be present before their effects can be 
observed in a cell. As a consequence, most tumor suppres¬ 
sor genes have been detected by studying relatively rare 
cancer syndromes in which one mutant copy of the tumor 
suppressor is inherited and the second alteration occurs 
during somatic development. 

4. Li-Fraumeni syndrome is caused by the inheritance of a 
mutation in the TP53 gene. The inherited mutation is 
present in all cells and greatly increases predisposition 
to tumor formation. However, because of the multistep 
nature of carcinogenesis, this inherited event is not suffi¬ 
cient to produce a tumor. Other events, occurring in 
somatic cells, must also take place. These somatic events 
are rare, so the probability of their occurrence in any given 
cell is small, explaining the low frequency of any specific 
tumor type. The involvement of many different tumor 
types in Li-Fraumeni syndrome is explained by the fact 
that normal p53 activity is required for growth regulation 
in many different tissues. Thus, the likelihood that an indi¬ 
vidual will develop at least one primary tumor, in one of 
many tissues, is very high. 

CHAPTER 12 

1. Because the trait is more common in females than in 
males, we infer that the threshold is lower in females 
than in males. Thus, an affected father is at greater risk 
for producing affected offspring than is an affected 
mother. The recurrence risk is higher in daughters than 
in sons. 

2. For a multifactorial trait, the recurrence risk decreases 
rapidly in more remote relatives of a proband (as shown 
in Table 12-2). In contrast, for an autosomal dominant 
gene, the recurrence risk is 50% smaller with each degree 
of relationship, reflecting the coefficient of relationship 
(see Chapter 4). Recall that first-degree relatives (parents, 
offspring, and siblings) share 50% of their DNA because 
of descent from common ancestors, second-degree rela¬ 
tives (uncles and nieces, grandparents and grandchildren) 
share 25% of their DNA, and so on. Thus, for a disease- 
causing genotype with 10% penetrance, the recurrence 
risk is obtained by multiplying the penetrance times the 
percentage of shared DNA between relatives: 5% (10% x 
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50%) for first-degree relatives, 2.5% (10% x 25%) for 
second-degree relatives, 1.25% (10% x 12.5%) for third- 
degree relatives, and so on. In addition, if the disease is 
multifactorial, the recurrence risk should increase in pop¬ 
ulations in which the disease is more common. There is 
typically no relationship between disease frequency and 
recurrence risk for an autosomal dominant disease. 

3. (1) The disease-causing genotype may have reduced pen¬ 
etrance, due, for example, to environmental factors. (2) A 
somatic mutation might have occurred after cleavage of 
the embryo, such that one twin is affected by the disorder 
and the other is not. 

4. These results imply that shared environmental factors are 
increasing the correlations among siblings, because sib¬ 
lings tend to share a more common environment than do 
parents and offspring. The spouse correlation reinforces 
the interpretation of a shared environment effect, although 
it is also possible that persons with similar body fat levels 
marry preferentially. 

CHAPTER 13 

1. The sensitivity of the test is 93% (93 of the 100 true disease 
cases were detected). The specificity is 99% (98,900 of 
99,900 unaffected neonates were correctly identified). The 
positive predictive value is 8.5% (93 of the 1093 neonates 
with positive tests actually had the disease). The false¬ 
positive rate is 1% (1 - specificity, or 1000 of 99,900), and 
the false-negative rate is 7% (1 - sensitivity, or 7 of 100). 

2. Because individual 3 is homozygous for the upper allele, 
we infer that the disease gene is on the same chromosome 
as the upper allele in both parents. Thus, individual 6, who 
inherited both copies of the upper allele, also inherited 
both copies of the PKU disease gene and is affected. 

3. The NPl disease gene is on the same chromosome as allele 

1 in the affected father. Thus, individual 6, who inherited 
allele 2 from his father, should be unaffected. Note that 
our degree of confidence in the answers to both question 

2 and question 3 depends on how closely linked the marker 
and disease loci are. 

4. The mating in generation I is uninformative, so we cannot 
establish linkage phase in the female in generation II. 
Thus, the risk estimate for her daughter cannot be 
improved from the usual 50% figure used for autosomal 
dominant disease genes. Diagnostic accuracy could be 
improved by assaying another, more polymorphic marker 
(e.g., an STR, which would more likely permit the accurate 
definition of linkage phase). 

5. The primary advantages of amniocentesis are a lower rate 
of fetal loss (approximately 0.2% to 0.3% versus 1% to 
1.3% for CVS) and the ability to do an AFP assay to detect 
neural tube defects. CVS offers the advantages of diagnosis 
earlier in the pregnancy and a more rapid laboratory diag¬ 
nosis. The CVS diagnosis may be complicated by confined 
placental mosaicism, and there is limited evidence for an 
association between early (before 10 weeks post-LMP) 
CVS and limb reduction defects. 


6. Huntington disease (HD) is a poor candidate for gene 
replacement therapy because it is caused at least in part by 
a gain-of-function mutation. Thus, simple gene insertion is 
unlikely to correct the disorder. It would be a better candi¬ 
date for antisense, ribozyme, or RNAi treatment, in which 
the defective gene product would be disabled. An addi¬ 
tional consideration is that this disorder affects primarily 
neurons, which are relatively difficult to manipulate or 
target. The fact that HD has a delayed age of onset, however, 
is encouraging because identification of the action of the 
HD gene might lead to drug therapy to block the gene 
product’s effects before neuronal damage occurs. 

CHAPTER 14 

1. Genetic information can be used to assess a person’s risk 
of disease and response to therapy. Persons at high risk for 
a disease could therefore be encouraged to alter their 
health management to reduce their risk. Similarly, genetic 
information can be used to predict a person’s response to 
therapy and whether the person is at increased risk for a 
serious adverse drug response. Drugs predicted to be less 
effective or likely to cause an adverse event could be 
avoided. For example, a person predicted to be at high risk 
for diabetes might be tested for hyperglycemia more fre¬ 
quently, placed on a medical diet earlier, and treated more 
aggressively for glucose intolerance. 

2. Persons with the same type of cancer might respond dif¬ 
ferently to therapy because the genetic abnormalities in 
their tumors are different or because of differences, for 
example, in the way that chemotherapeutic drugs are 
metabolized. 

3. Race has traditionally been used to categorize large groups 
of individuals and reflects geographic origin, language, 
and various cultural attributes that describe a group (e.g.. 
Native Americans or South Asians). Ancestry refers to the 
geographic, historical, or biological origins of one’s ances¬ 
tors and, for any individual, can be complex. 

4. Genetic information about a person’s ancestry can influ¬ 
ence the person’s perception of his or her biological and/ 
or cultural identity. For example, some people who self- 
identify as African American have sought to find the spe¬ 
cific geographic regions of sub-Saharan Africa where some 
of their ancestors once lived. In some cases, this informa¬ 
tion suggests specific populations, and therefore cultural 
links, with which an individual might identify. On the 
other hand, genetic information sometimes suggests that 
a person has little, if any, biological ancestry from the 
populations with which he or she self-identifies. Ulti¬ 
mately, every person is a member of many different popu¬ 
lations and has multiple identities—social, economic, 
religious—and genetic ancestry information provides 
little insight about who they are but provides, instead, 
some information about where they came from. 

5. Examples of polymorphisms that influence drug metabo¬ 
lism include variants of CYP2C9 and VKORCl that influ¬ 
ence the metabolism of warfarin, an anticoagulant drug; 
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CYP2D6 variants that affect the biotransformation 
of P-adrenergic receptor antagonists, neuroleptics, and 
tricyclic antidepressants; NAT2 variants that affect the 
inactivation of isoniazid, a drug commonly used to treat 
tuberculosis; and G6PD that influences sensitivity to the 
antimalarial drug, primaquine. Response to antihyperten¬ 
sive P-blockers has been associated with variants in genes 
that encode subunits of the P-adrenergic receptor. 

6. Potential obstacles to the use of genetic information in 
personalized medicine include an inability to identify 
genetic and environmental risk factors (and their interac¬ 
tions) that enable accurate prediction of clinically signifi¬ 
cant risk; lack of evidence demonstrating that individual 
risk assessment improves diagnostic accuracy and treat¬ 
ment outcome; lack of technologies for cost-efficient 
assessment of an individual’s genome; building an infra¬ 
structure for clinicians to access risk data, interpret risk 
information, and explain risk estimates to patients; 
and the need for guidelines and policies for how risk- 
assessment information should be used in clinical and 
research applications. 

7. The use of individual genetic variants to predict risk of 
disease and/or response to pharmacologic agents can be 
considered the practice of genetic medicine, whereas the 
assessment of the action of many genes simultaneously to 
predict disease risk or drug response distinguishes genomic 
medicine. 

8. Potential uses of whole-genome data from an individual 
include screening for inborn errors, metabolism in new¬ 
borns (i.e., newborn screening), testing for carriers of 
genetic disorders (e.g., sickle cell disease, cystic fibrosis), 
assessing risk for common diseases, predicting drugs that 
might influence risk for a serious adverse drug, and foren¬ 
sic identification. 


CHAPTER 15 

1. Allen’s family includes several members who have had 
myocardial infarctions at a relatively young age. The pedi¬ 
gree suggests that an autosomal dominant gene predispos¬ 
ing family members to heart disease may be segregating in 
this family. This could be caused by autosomal dominant 
familial hypercholesterolemia, or possibly by another dis¬ 
order of lipid metabolism. Allen should be encouraged to 
have his serum lipid levels tested (total cholesterol, LDL, 
HDL, and triglycerides). If his LDL level is abnormally 
high, intervention may be necessary (e.g., dietary modifi¬ 
cation, cholesterol-lowering drugs). 



2. Based on the fact that Mary’s two brothers and one uncle 
were all affected, we can be quite certain that her mother 
is a carrier of the DMD disease-causing mutation. (If only 
one of Mary’s brothers had been affected, we would have 
to consider the possibility that he was the product of a new 
mutation.) If Mary’s mother is a carrier, then there is a 
probability of 1/2 that Mary is a carrier. As we saw in 
Chapter 5, a female carrier will transmit the disease gene 
to half of her sons, on average. Thus, the probability that 
one of her sons will be affected with DMD is l/2x 1/2 = 
1/4. The creatine kinase test contributes additional infor¬ 
mation. We can set up the Bayesian calculation as follows: 



Mary Is 

Mary Is T 


a Carrier 

a Carrier 

Prior probability 

1/2 

1/2 

Conditional probability 

2/3 

0.05 

that her CK is in 



the 95th percentile 



Joint probability 

1/3 

0.025 

Posterior probability 

0.93 

0.07 


It should be clear that the conditional probability of 
being a carrier, given a CK level above the 95th percentile, 
is 2/3. The conditional probability of not being a carrier 
with this CK level must be 0.05, because only 5% of CK 
values in normal persons fall above the 95th percentile. 
Thus, the information derived from the CK test has 
increased Mary’s probability of being a carrier from 1/2 to 
0.93. Because the probability that she would transmit the 
DMD gene to her male offspring is 1/2, the probability of 
producing an affected male increases from 0.25 to 0.47. 
Currently, it is likely that additional tests, such as screening 
for DMD mutations and performing a dystrophin assay, 
would yield even more precise information. 

3. The prior probability that Bob has inherited the mutation 
from his father is 1/2. Because 85% of gene carriers mani¬ 
fest symptoms by age 51 if they inherited the mutation from 
an affected father, the conditional probability that Bob is 51 
years old and unaffected, given that he inherited the gene, 
is 0.15. The probabilities can be set up as in the table below. 
In this example, the incorporation of age-of-onset infor¬ 
mation decreased Bob’s chance of inheriting the disease 
gene from 50% to only 13%. With the identification of the 
Huntington disease gene. Bob would likely be given a DNA 
diagnostic test to determine with certainty whether he had 
inherited an expanded repeat mutation from his father. 



Bob 

Bob Does 


Carries 

Not Carry 


the HD 

the HD 


Mutation 

Mutation 

Prior probability 

1/2 

1/2 

Conditional probability 

0.15 

1 

that Bob is normal at 



age 51 years 



Joint probability 

0.075 

1/2 

Posterior probability 

0.13 

0.87 
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tti-Antitrypsin deficiency, 2f, 175t-176f, 
272-273, 272t, 273b, 283 
APC gene, 222f, 230-233 
APCs. Antigen-presenting cells (APCs) 
Apert syndrome, 200/7-202/7, 202/ 202f 
Apical ectodermal ridge (AER), 213, 214/ 
Apnea, 113 
Apolipoproteins, 249 
Apoptosis, 217, 218/ 

TP53 gene and, 229 
Appendages, formation of, 208 
Arachnodactyly, 75/ 

Marfan syndrome and, 75 
Argininosuccinase, 147/ 
deficiency of, 148 

Argininosuccinic acid synthetase, 147/ 
deficiency of, 134f, 148 

Array comparative genomic hybridization, 107, 
108/ 

Arthropathy, 151 




Artificial chromosomes 
as cloning vector, 44:b-4:5b 
human, 285 

in Human Genome Project, 169^7 
Ashkenazi Jewish population, 57-58, 268-269, 
269t 

ASOs. See Allele-specific oligonucleotides 
(ASOs) 

Aspiration pneumonia, 113 
Association studies, genome-wide, 172-173, 
172/-173/, 248/7 
Associations 

in populations, 164-166 
whole-genome consumer services and, 295b 
Asthma, 287, 292, 296b 
Ataxia, 92-93 

Ataxia telangiectasia, 38f, 175f-176f, 195f, 222t, 
234 

Atherosclerosis, 38f, 247-249 
ATM gene, 222f, 234 
ATP7A gene, 151 
ATP7B gene, 151-152 
Autism, 174, 245t 
Autism spectrum disorder, 261-262 
Autoimmune diseases, 257 
type 1 diabetes as, 257 

Autoimmune disorders, expression profiles in, 
298 

Autoimmunity, 191-192 
Automated DNA sequencing, 50, 52/ 
Autoradiogram, 42-45, 45/ 59/ 

Autosomal disorders, 118f 
Autosomal dominant breast cancer, 270, 271/ 
274b 

Autosomal dominant disease, prevalence of, 4f 
Autosomal dominant inheritance, 63-64 
cautions about, 66-67 
characteristics of, 63-64 
major attributes of, 66t 
recurrence risks of, 64 
severity in homozygotes, 66-67 
Autosomal dominant nocturnal frontal lobe 
epilepsy, 253f-254t 
Autosomal recessive disease 

inbreeding and. See Consanguinity 
metabolic, 133 
prevalence of, 4t 

Autosomal recessive inheritance, 64-67 
cautions about, 66-67 
characteristics of, 65-66 
detectability in heterozygotes, 67 
major attributes of, 66t 
recurrence risks of, 66 
Autosomes, 6 

Axis specification, 198-199, 208 
Azathioprine, 294f 

B 

B-cell immunodeficiency disease, 194 
B-cell response, 181-182, 183/ 

B lymphocytes, 180-181, 181/ 

Bacterial artificial chromosomes, 44b-45b, 169/7 
Bacteriophage PI artificial chromosomes, 169/7 
Bacteriophages, 44b-45b 
Balanced chromosome rearrangements, 118 
Balanced rearrangements, of chromosome, 

118f 

Baldness, male-pattern, 91 
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Banding, of chromosomes, 104-105, 106/ 

Bands, of autoradiogram, 42-45, 45/ 
Bardet-Biedl syndrome, 163/7 
Bare lymphocyte syndrome, 194, 195t 
Barr bodies, 80 

Basal transcription factors, 13, 13/ 

Base analogs, 39 

Base-pair substitution, 28-29, 30/ 

Bases 

in DNA, 6, 7/-8/ 

complementary pairing of, 8 
in RNA, 11 

Bayes theorem, 305/7-306/7, 305/-306/, 305f-306f 
Beare-Stevenson syndrome, 202t 
Becker muscular dystrophy, 84, 175M76f, 
286-287 

Beckwith-Wiedemann syndrome, 95-96, 96/, 
302/7, 302/ 

Bendectin, 516b 

Benign neonatal epilepsy, 253f-254f 
Benign neoplasm, 217 
Beta blockers, 293-294 
P sheets, 14t 
Bioethics, 317-318 

Biotin-responsive multiple carboxylase 
deficiency, 282 
Bipolar affective disorder, 2t 
Bipolar disorder, 261-262 
Birth defects, 197. See also Chromosome 

abnormalities; Congenital malformations; 
Metabolic disorders 
Bivalent chromosomes, 23 
“Black urine disease,” 132 
Blastocyst 

inner-cell derived from, 289 
preimplantation genetic diagnosis in, 281 
Blastomere, preimplantation genetic diagnosis 
in, 281 

Blindness, 268-269 
Blood groups, 40-41 

Bloom syndrome, 38f, 175f-176f, 194, 509b 
Blue cone monochromacy, 86 
Body plan, 197, 208 
Bone dysplasia, 510b 

Bone marrow transplantation, for sickle cell 
disease or p-thalassemia, 35 
Bone morphogenetic protein (BMP) family, 
208,215 

Bone tumors, malignant, 70 
Bowel tumors, proximal, 38f 
Brachydactyly, 199-202 
Brain cancer, 297 
BRCAl gene, 222t 
mutations, 234 
screening for, 270 
BRCA2 gene, 222f 
mutations, 234 
screening for, 270 
Breakpoints, translocation, 129 
Breast cancer, 129r, 218-219, 222f, 225, 230, 

233-234, 235/ 243-244, 248b, 256, 262-263, 
266/7, 270, 273-274, 274b 
in colorectal cancer cases, 256 
developing hereditary, 256 
ERBB2 oncogene, 225 
expression profiles in, 297 
familial, 30, 175t-176t, 272t, 511b 
aggregation of, 256 


Breast cancer (Continued) 

frequencies in dissimilar populations, 

218-219 

increased risk of, 256 
inherited, 233-234 
invasive, 256 
in older persons, 4 
relative risk of, 235/ 

Brittle bone disease. See Osteogenesis 
imperfecta 

Burkitt lymphoma, 129, 129f 

vs. diffuse large B-cell lymphoma, 297 

c 

Cafe-au-lait spots, 72-73, 505b, 303/ 

“Calico” cat, 79-80 
Campomelic dysplasia, 204 
Cancer, 217,251b. See also Cancer genes; 
Carcinogenesis 
causes of, 217-219 
cell immortality, 225-226 
environmental considerations in, 218-219 
familial colorectal, 218/ 
genetic inheritance in, 237 
genetics, 1, 3, 217-238 
inherited predispositions, 218 
as multifactorial disorders, 255-256, 262 
Cancer cytogenetics, 128-130, 129f 
Cancer genes, 219-221. See also DNA repair; 
Oncogenes; Tumor suppressor genes 
cell regulation and, 219-220, 219/ 
inherited vs. somatically altered, 220-221, 
221 / 

major classes of, 221-226 
Cancer genomics, 297-298, 297/ 

Cancer-causing genes, 221, 226-227 
breast, 234 

identification of inherited, 226-237 
Carbamazepine, as teratogen, 315f 
Carbamyl phosphate synthetase, 147/ 
deficiency of, 134f, 148 
Carbimazole, as teratogen, 515t 
Carbohydrate metabolism, disorder of, 133-138, 
134f 

Carcinogenesis, 217 

multi-hit concept of, 220 
two-hit model of, 220 
Carcinogens, 217 
Carcinomas, 217 

Cardiac arrhythmias, 96, 273-274 
Cardiofaciocutaneous syndrome (CFC), 

202-203 

Cardiomyopathy, 141, 151, 249, 273-274 
Cardiorespiratory failure, Huntington disease 
and, 71 

Cardiovascular disorders, 247-255 
Cartilage-derived morphogenetic protein 1 
(CDMPl), 199-202 
Cataracts, 38f, 96, 135-136 
p-Catenin, 230-233 
C-banding, 104 

CCR5 (chemokine receptor 5), 191b, 288 

CDHl gene, 222f 

CDK inhibitors, 223, 223/ 

CDK4 gene, familial melanoma and, 235 
CDKN2A gene, 222t 
familial melanoma and, 235-236 
Celiac disease, 192f 
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Cell, anatomy of, 7/ 

Cell biology, 6-27 
Cell cycle, 22-26 

control of, by tumor suppressors, 70 
regulation of, 223/ 

Cell growth and differentiation 
genetic control of, 219-220, 219/ 
proto-oncogenes and, 223 
Cellular immune system, 180-184, 185/ 
comparison with, 184-185 
Centimorgans (cM), 155-157 
Central nervous system anomalies, 315t 
Central nervous system defects, 315f 
Centrioles, 23, 24/ 

Centromeres, 23, 24/ 26/ 
satellite DNA in, 21 
Cerebellar ataxia, 38r 
Cerebral edema, 135, 141 
Cerebral hemorrhage, 315f 
Cervical cancer, 229 

human papilloma virus and, 229 
Cervical ridges, 315f 
CFTR gene, 61^7-62^?, 62/ 

CG dinucleotides, methylated, 39 
Rett syndrome and, 90 
on X chromosome, 81 
CG islands, 168 
Chain reaction, 49 
Charcot-Marie-Tooth disease, 29, 31 
Chediak-Higashi syndrome, 195f 
CHEK2 gene, 222f, 229, 234/ 

Chemical mutagens, 39 
Chiasmata (chiasma), 23-24, 25/-26/ 

Childhood blindness, 284f 
Chloride ion transport defect, 61-62 
Choanal atresia, 315f 
Cholesterol, 141-142 

pattern formation and, 206 
Smith-Lemli-Opitz syndrome and, 134t, 
141-142, 142/ 206 

as steroid hormone precursor, 142, 143/ 
Chordin, 206, 208 
Chorea, 69 

Chorionic villus sampling (CVS), 277-278, 277/ 
inborn errors of metabolism and, 278, 278f 
Christmas disease, 83 
Chromatin, 6, 9/ 
gene activity and, 14 

Chromatin condensation, imprinting and, 96 
Chromatin loops, 7-8, 9/ 

Chromosome(s), 6, 7/ 
banding of, 104-105, 106/ 
comparative genomic hybridization with, 
107-108, 108/ 

fluorescence in situ hybridization with, 
105-107, 106/ 

morphology of, physical mapping from, 166 
nomenclature for, 105f 
number of 

abnormalities of, 108-117 
in human cell, 3 
structure and function of, 6-27 
Chromosome abnormalities, 103 
childhood deaths attributable to, 4f 
clinical phenotypes and, 127-128 
nomenclature for chromosome karyotypes of, 
I05t 

pregnancy loss and, 117-118 


Chromosome abnormalities (Continued) 
prevalence of, 2f, 4f, 118f 
in structure, 118-127. See also Deletions; 
Duplications; Translocations 
inversions, 126-127, 127/ 
isochromosomes, 127, 128/ 
ring, 126, 126/ 

subtelomeric rearrangements, 123-125, 
125/ 

uniparental disomy, 125-126, 125/ 
Chromosome analysis, indications for, 129^ 
Chromosome breakage, 118 
Chromosome disorders, 3 
meiosis and, 26 

Chromosome instability syndromes, 130 
Chromosome integrity, in cancer, 225 
Chronic granulomatous disease (CGD), 194, 
195t, 278, 284^7 

Chronic myelogenous leukemia (CML), 
128-129, 129f, 287 
Chronic nephrolithiasis, 149 
Chronic progressive external ophthalmoplegia 
(CPEO), 93 
Clarithromycin, 294t 
Class switching, 185-186 
Clastogens, 118 
Cleft lip, 2t, 3, 279b, 313 
Cleft palate, 2t, 3, 241, 247f, 313 
with ankyloglossia, 88f 
Cleidocranial dysplasia, 98t-99t 
Clinical cytogenetics, 103-131 
Clinical genetics, 301-320. See also Genetic 
counseling 

common indications for referral, 312^ 
definition of, 301 

evaluation and services in, 309-312 
principles and practice of, 301-312 
types of services and programs, 311^ 

Cloning, 44^-45^ 

and embryonic stem cells, 289, 318 
functional, 167-168 
in Human Genome Project, I69b 
physical mapping and, 166-172 
positional, 167-168 
reproductive, 289, 289/ 318 
Club foot, 2t, 245f, 247f, 276-277 
Co-activators, 13, 13f 
Coagulation factors, stroke and, 252-254 
Coated pits, 251/ 

Cocaine, as teratogen, 315t 
Cockayne syndrome, 38f 
Codeine, 294t 
Codominant alleles, 55 
Codons, 15, 16r 

Coefficient of relationship, 74, 76h-77b, 76/ 78/ 
Coenzyme A, 147/ 

Col2Al gene, 204 
Collagen 

in embryonic development, 204 
type I, 18-19 

osteogenesis imperfecta and, 18^-19^ 
posttranslational modifications of, 19 
Colon cancer, 30, 266b, 273-274, 296b, 297, 
310-311 
familial, 270 
in older persons, 4 
pathway of, 230/ 
screening for syndromes of, 271 


Color vision, 86b-88b 

Colorectal cancer, 220, 229, 232b, 256, 273-274, 
274^7 

breast cancer, 256 
familial, 218/ 256, 263, 272t 
family history of, 275 
hereditary nonpolyposis, 230, 256 
in HNPCC, 233 
inherited cause of, 256 
migrant Japanese populations, 219 
risk of, 256 

Common disease, precision medicine for, 
296-298 

Comparative genomic hybridization (CGH), 
107-108, 108/ 

Complement system, 180 
defects of, 194, 195r 
Complementary base pairing 
in DNA replication, 8 
mRNA and, 12 

Complementary DNA (cDNA) library, 1746 
Compound heterozygotes, 35, 36/ 

Concordance rates, 245, 245f 
Concordant twins, 244-245 
Conditional knockout, 2096-2116 
Conditional probability, 305 
Conductive hearing loss, 112, 1126 
Cone cells, 86 

Congenital adrenal hyperplasia (CAH), 142-143, 
144/ 2666 

fetal treatment of, 282 
screening for, 267, 268t 
Congenital anomaly, common multiple, 

312-313, 3\2t 

Congenital bilateral absence of vas deferens 
(CBAVD), 62 

Congenital contractual arachnodactyly, 75 
Congenital hearing loss, 266b 
Congenital heart defects, 2t, 57, 110, 112-113, 
247t 

Congenital malformations 
cause of, 312 

isolated and syndromic, 313 
of multifactorial diseases, 247, 247f 
prevalence of, 2t, 4t 
rates, 247f 
prevention of, 314 
Congenital myotonic dystrophy, 313 
Congestive heart failure, Marfan syndrome and, 
75 

Conotruncal heart defects, 315f 
Consanguinity 

in autosomal recessive disease pedigree, 

65-66 

health consequences of, 74-77 
in human populations, 74-77 
measurement of, 766-776, 76/ 
mortality levels associated with, 77t 
and recessive disease frequency, 74 
Consensus sequences, 20 
Conserved sequences, 168 
Constant region, of immunoglobulin chain, 

186, 186/ 

Constitutional mutation, 220 
Constitutive heterochromatin, 104 
Contiguous gene syndrome, 122-123 
Cooley’s anemia, 34 
Copper, 149-152 




Copy number variants, 46-48, 48/ 
Cordocentesis, 278 
Cori disease, 136/ 138f 
Cornelia de Lange syndrome, 312f 
Coronary artery disease (CAD), 247-249, 
25017-25217,310-311 
gene therapy for, 288 
Correlation coefficients, 245 
Cosmids, 44^-45^ 

Costello syndrome, 202-203, 203/ 
Costimulatory molecules, 181-182, 184/ 
Cowden disease, 234 
Craniofacial anomalies, 315f 
Craniofacial defects, 315f 
Craniofacial development, 208-213 
Craniofacial dysmorphology, 315t 
Craniosynostosis syndromes, 200^7-202^, 202f, 
213,315f 

Creatine kinase (CK), Duchenne muscular 
dystrophy and, 84, 267/ 

Cri-du-chat syndrome, 120 
CRISPR/Cas system, 209b~211b 
Crossover, 23, 26/ 
double, 154, 155/ 
factors influencing rate of, 156-157 
recombination in, 154-155, 155/-156/ 
unequal, 87, 87/ 

Cross-reaction, autoimmune, 191-192 

Crouzon syndrome, 200^7-202^7, 202r 

Cryptic splice sites, 29-30 

CTLA4 gene, 257 

Cutis aplasia, 113-114 

Cutis laxa, 315f 

Cyclin-dependent kinases (CDKs), 23 
oncogenes, 224t 
tumor suppressor genes, 222 
Cyclins, 23 

CYP2D6, 294-295, 294r 

Cystic fibrosis, 2t, 3, 4/ 61b-62b, 61/-62/, 

157-159, 171, 175M76r, 209^7-21 Ifo, 266b, 
269,274/7, 281,283 
clinical impact of, 4 
consanguinity and, 74 
direct diagnosis of, 272-273 
frequency of, 56 
gene flow and, 58 
genetic drift and, 57-58 
genetic variation of, 53-54 
heterozygote screening for, 269f 
inheritance patterns of, 67 
newborn screening for, 269f 
pleiotropy in, 74 
population screening for, 270b 
variable expression of, 72 
Cystic kidneys, 279b 
Cystine, 149 
Cystinosis, 134f, 149 
Cystinuria, 134r, 149 
Cytochrome c oxidase deficiency, 134f 
Cytochrome P450 gene, 294f, 295-296 
Cytogenetics, 3, 103 
cancer, 128-130, 129r 

technology and nomenclature of, 103-108, 
104/ 

Cytokines, 181-182, 184/ 

helper T lymphocytes and, 184 
Cytokinesis, 22 

Cytomegalovirus, immune evasion by, I9lb 
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Cytosine, 6, 7/-8/ 

methylation of, 39, 40/ 

Cytotoxic T cells, gene therapy and, 287-288 
Cytotoxic T lymphocytes, 180-182, 181/ 185/ 

D 

Daughter cells, 23, 24/ 

Deafness, 38t 

Deformation, definition of, 313 
Degenerate code, 15 
Dejerine-Sottas syndrome, 29 
Deletions 

of base pairs, 28-29 
chromosomal, 120 

fluorescence in situ hybridization and, 105 
dosage mapping using, 167 
mapping, 166-167, 167/ 
microdeletion syndromes, 120-123, 122/ 
123r, 124/7 

physical mapping and, 166-167, 167/ 
submicroscopic, detection of, 170 
22qll deletion syndrome and, 124/ 
Dementia, 92-93 
Dendritic cells, 181 
Dental anomalies, 122 

Dentatorubral-pallidoluysian atrophy, 98t-99t 
Deoxyribonucleic acid (DNA). See DNA 
(deoxyribonucleic acid) 

Derivative chromosomes, 118 
Dermatitis, severe, 152 
Deuteranomaly, 86, 87/ 

Deuteranopia, 86, 87/ 

Development 

animal models for, 197, 198r, 209b-211b, 

210 / 

basic concepts of, 197 
embryonic, major processes in, 198-199 
genetic basis of, 197-216 
genetic mediators of, 199-205 

DNA transcription factors, 203-204 
extracellular matrix proteins, 205 
paracrine signaling molecules, 199-203, 
203/ 

pattern formation, 205-215 
Developmental delay, 135-136, 315f 
Developmental genetics, 197 
Diabetes insipidus, nephrogenic, 88t 
Diabetes mellitus, 2t, 3, 4/ 28, 38t, 139, 239, 
245t, 257-258, 258f, 280, 292, 293b, 314, 
318 

genetics and, 1, 3 

maturity-onset diabetes of the young, 258 
type 1, 134t, 137, 257, 258t 
animal model, 257 
expression profiles in, 298 
HLA alleles associated with, 166 
HLA-DQBl and, 190, 192r 
type 2, 134t, 137, 257-258, 258r, 296b, 298 
HLA in Pima Native Americans, 172 
Diaphragmatic hernia, 112-113, 279b, 282 
Diarrhea, 152 
Dicer, 287, 287/ 

Dichromatic vision, 86 
Dideoxy method, 50, 51/ 

Diethylstilbestrol, 315t 
Differentially methylated regions (DMRs), 95 
Diffuse large B-ceU lymphoma, Burkitt 
lymphoma vs., 297 


Digenic inheritance, 163/7 
DiGeorge sequence, \24b, 194 
DiGeorge syndrome, 123f 
Dilated cardiomyopathy, 249-252 
familial, 253r-254f 
Diploid cells, 6 
Direct diagnosis 

advantages and disadvantages of, 271, 272t 
methods of, 272r 
Direct mutation analysis, 271-275 
Disabilities, 317 
Discordant twins, 244-245 
Disomy, uniparental, 94-95 
Dispermy, 108-109 
Dispersed repetitive DNA, 21, 22/ 

Disruption, definition of, 313 
Dizygotic twins, 244 
DMPK gene, 96-97 

DMRs (differentially methylated regions), 95 
DNA (deoxyribonucleic acid) 
amplification of, 48-50 
coiling of, 7-8, 9/ 
composition and structure of, 6-7 
constituents of, 7 
family of, 21-22 
genes composed of, 6 
history of, 1-3 
replication of, 8-11, 10/ 
cell cycle and, 22 
sequencing of, 50, 51/-52/ 
in mutation detection, 53t 
structure of, 3 
types of, 21-22, 22/ 

DNA mismatch cleavage, 53f 
DNA polymerase, 8 

in polymerase chain reaction, 49, 49/ 

DNA profiles, in forensic setting, 47b, 47f 
DNA repair, 39 

diseases caused by defect in, 37b-38b, 38/ 38t 
DNA repair genes, 225 
BRCAl and BRCA2 genes and, 234/ 
genomic instability, 225 
DNA sequences 
amplification of, 48 
computer analysis of, 168-170 
consensus, 20 
conserved, 168 
PCR for, 50 

Dolichostenomelia, Marfan syndrome and, 75 
Dominant allele, 28 
Dominant disorders 

gain-of-function mutations in, 31 
mutation-selection balance in, 58 
Dominant negative mutation, 32, 32/ 

Donor site, 29-30 
Dorsal/ventral axis, 198-199 
formation of, 208 
Dosage compensation, 79 
Dosage mapping, 167 
Dosage sensitivity, 29 
Double helix, DNA, 6-7, 8/ 

Double-stranded DNA breaks, 39, 234 
Down syndrome. See Trisomy 21 
Downstream DNA sequences, 8 
DPC4 gene, 222t 
Drug, 293 

individualized therapy, 293-296, 294f, 295/ 
predicting serious adverse responses, 293 
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Duchenne muscular dystrophy, 2f, 3, 30, 39, 81, 
84/7, 85/, 175M76f, 266/?, 268, 271-273, 
281,283,286-287 
Bayesian analysis of, 305-306 
creatine kinase and, 84, 267/ 
gene, 285 

gene mapping of, 167 
gene therapy for, 284f 
genes causing, promoters in, 12 
germline mosaicism in, 68 
Duplications 

chromosomal, 126 
dosage mapping using, 167 
of genes, 29 
Dynein, 2llb-2\2b 
Dysarthria, 151 

Dysmorphic syndromes, 301-302 
Dysmorphology. See also Congenital 
malformations 
definition of, 301, 312-314 
isolated and syndromic, 313 
key terms in, 313 
sequence in, 313-314, 314/ 

Dysplasia, definition of, 313 
Dystroglycan-sarcoglycan complex, 84, 85/ 
Dystrophin, 84, 85/ 

E 

Ear malformation, 315f 
EcoRI, 42, 43/ 

Ectoderm, 206 
Ectodermal dysplasia, 313 
Ectopic expression, 197, 198/ 

Edwards syndrome. See Trisomy 18 
E2F transcription factors, 70 
Ehlers-Danlos syndrome, 175r-176r 
Electron transfer flavoprotein (ETF), 148 
Electrophoresis, protein, 41-42, 42/ 

Ellis-van Creveld syndrome, 57 
Embryonic stem cells (ESCs), 289, 318 
of mouse, 209b~2llb, 210/ 

Emphysema, 273b 

Employment, discrimination of, 318 
Encephalocele, 242b-243b, 242/ 279b, 3i7b 
Encephalopathy, 141 
Endocytosis, 250/7-252/?, 251/ 

Endoderm, 206-208 
Endoplasmic reticulum, 7/ 

rough, posttranslational modification in, 19 
Endothelin-3, Hirschsprung disease and, 204b 
Endothelin-B, Hirschsprung disease and, 204b 
End-stage renal disease, 298 
Energy production defects, 134r, 148 
Engrailed (En-l) gene, 213 
Enhancers, 13, 13/ 

Environment, genes and, 244-247 
Environmental considerations, in cancer, 
218-219 

Environmental factors, 239 

congenital malformation and, 247 
in genetic disease, variable expression and, 
69-72 

phenotypes and, 60-63 
relative influence of, 244 
Environmental influences, on genetic diseases, 
3,4/ 

Epidermolysis bullosa, 301 
Epigenetics, 23\b 


Epilepsy, 92-93, 245t, 253r-254f, 262 
Equational division, 23, 26 
Equatorial plane, in mitosis, 23, 24/ 

ERBB oncogene, 224 

ERBB2 oncogene, 225 

Erythroblastosis fetalis, 193 

ETF. See Electron transfer flavoprotein (ETF) 

ETF-ubiquinone oxidoreductase (ETF-QO), 

148 

Ethics, 317-318 

Ethnic groups, heterozygote screening in, 

269t 

Euchromatin, 14 
Eugenics, 318 
Eukaryotes, 15 

intron-exon structure of, 20 
Euploid cell, 108 
Ewing sarcoma, 129f 
Exome, 52 

Exome sequencing, 173-176, 174/ 

Exons, 15, 15/ 20-21 
Expanded repeats, 30-31 
Expressed sequence tags (ESTs), 170-171 
Extracellular matrix, in embryonic 
development, 205 
Eye, development of, 197, 198/ 

F 

Fabry disease, 146f 
Facial hemangioma, 302b 
Faciogenital dysplasia, 88t 
Factor V Leiden allele, 252-254 
Factor VIII, 82, 90 
Failure to thrive, 135-137 
False-negative rate, 267 
False-positive rate, 267 
Familial adenomatous polyposis, 310-311 
Familial adenomatous polyposis gene, 229-233, 
232b, 232/ 237 

Familial amyotrophic lateral sclerosis, 

253r-254r 

Familial breast cancer, 30, 234 
Familial cardiomyopathy, 272t 
Familial dilated cardiomyopathy, 253t-254f 
Familial hypercholesterolemia, 31-32, 249, 
250/7-252/7, 250/ 253t-254f, 263, 310-311 
Familial melanoma, 222t, 235-236 
Familial Parkinson disease, 253r-254f 
Familial polyposis, 30 
Family history, 3\\b 

Family screening, for genetic disorders, 266 
Fanconi anemia, 38t, 194, 234, 318 
Fatty acids, metabolism of, 141, 141/ 

EBNl mutations, 75/7-76/? 

EBN2 mutations, 75/7-76/? 

F508del, 62 
Fertilization, 26 

Fetal alcohol syndrome (FAS), 312f, 314, 3l6b, 
316/ 

Fetal death, 315f 

Fetal DNA, in maternal circulation, 281 
Fetal hemoglobin, 32-33 
Fetal treatment, 282 
a-Fetoprotein (AFP) 
in amniotic fluid, 275 
in maternal serum, 275, 279-280, 280/ 
Fibrillin, 75 
Fibrillin-1, 171-172 


Fibroblast growth factor (FGF), 234 
in gene therapy, 288 

Fibroblast growth factor (FGF) family, 199 
disorders of, 200/7-202/?, 201/ 207-208, 
209/7-211/7,211/ 

in limb development, 213 
First-degree relatives, 63 
5' cap, 12 

Fluconazole, as teratogen, 315t 
Fluorescence in situ hybridization (FISH), 
105-107, 106/ 
in amniocytes, 275 
Folate, neural tube defects and, 3\7b 
EOS gene, 219-220, 219/ 224f 
Founder effect, 57-58 
FOXP3 gene, 192 
Fragile site (FRAXE), 98f-99f 
Fragile X syndrome, 2t, 97-101, 98t-99t, 100/ 
175M76L 209 / 7 - 211 / 7 , 266b, 272t, 301 
Frameshift mutation, 28-29, 30/ 

Friedreich ataxia, 97-99, 98t-99t, I75t-I76t 
Fructose, metabolism of, 136-137, 136/ 
Fructose 1,6-bisphosphatase (FBPase) 
deficiency, 136/ 

Fructose intolerance, hereditary, 134t, 136-137 
Fructosuria, 134t, 136, 136/ 

Functional cloning, 167-168 
Fusion genes, 88 

G 

GO stage, 22-23 
G1 (gap 1) phase, 22, 22/ 

G2 (gap 2) phase, 22, 22/ 

Gain-of-function mutations, 31, 32/ 
Galactose, metabolism of, disorder of, 

133-136 

Galactosemia, 133, 134f, 135, 136/ 139, 
175M76L 266b, 268, 278t, 296b 
consanguinity and, 74, 76 
newborn screening for, 269r 
Gametes, 6 

Gametogenesis, meiosis and, 26 
Gangliosidosis, 278t 
Gastrointestinal atresias, 279b 
Gastroschisis, 275-276, 279b 
Gastrulation, 206, 207/ 

Gaucher disease, 145-147, 146f, 209/7-211/? 

genetic drift and, 57-58 
Gene, family of, for opsins, 88 
Gene expression 

cancer genomics and, 297, 297/ 
in common disease, 296-298 
DNA microarrays and, 53 
regulation of, 12-15, 13/ 
testing for, 170-171, 170/-171/ 

Gene flow, 58 
Gene frequency, 55 

Gene mapping, 154-166. See also Linkage 
analysis 

Gene splicing, 15, 15/ 

Gene therapy, 265-291 
gene blocking in, 285-287 
germline, 288 

for noninherited diseases, 287-288 
perspective in, 288 
somatic cell, 282, 284f 

gene replacement in, 282-285 
nonviral vectors for, 285 
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for severe combined immunodeficiency, 
2S4b, 2Mt 
viral vectors for, 285 
TP53 gene in, 229, 287-288 
General transcription factors, 13, 13/ 
Generalized epilepsy, 253f-254f 
Generalized epilepsy with febrile seizures plus 
type 3, 253t-254t 
Genes 

as basic unit of inheritance, 6 
disease causing 

factors affecting expression of, 67-74 
identification of, 154 
DNA and, 6 

and environment, 244-247 
history of genetics and, 1 
intron-exon structure of, 20 
number of, in humans, 3, 6 
structure of, 20-22, 21/-22/ 

Genetic advocacy groups, 309 
Genetic code, 15, 16f 
Genetic counseling, 301-320 

accurate diagnosis and, 301-304, 3Q2b 
common indications for referral, 512b 
definition of, 304, 307 
effectiveness of, 308 
negative family history and, 303^7 
nondirectiveness in, 305-307 
recurrence risks and, 304-305, 5Q5b-5Q6h, 
305/-306/, 306t, 307 
support groups in, 309 
tasks of, 304, 307 

Genetic counselors, 308-309, 51Qb 
Genetic diseases 

childhood deaths attributable to, 4f 
clinical impact of, 3-4 
environmental influences on, 3, 4/ 
history of research on, 1-3 
and pediatric hospital admissions, 4 
prevalence of, 2t, 4f 
types of, 3 
Genetic drift, 57 

linkage disequilibrium and, 164 
Genetic engineering, Hb-iSb 
Genetic enhancement, 288b-2S9b 
Genetic Information Nondiscrimination Act 
(GINA), 318 

Genetic lethal mutation, 58 
Genetic screening 
definition of, 265 
molecular tools for, 271-275 
prenatal, 275, 279 
as presymptomatic diagnosis, 270 
principles of, 265-268 
psychosocial implications of, 270-271 
types of, 265, 266b 

Genetic testing, 265-291. See also Genetic 
screening; Prenatal diagnosis 
of children, 318 
definition of, 265 
direct diagnosis in 

advantages and disadvantages of, 271, 272t 
methods of, 272t 
direct-to-consumer, 274:b 
ethical concerns in, 317-318 
indirect diagnosis in, 271 
limitations of, 274^ 
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Genetic variation 
causes of, 56-58 
detection and measurement of, 

40-53 

in blood groups, 40-41 
at DNA level, 42-53 
with protein electrophoresis, 41-42 
origin of, 28-59 
in populations, 53-58 
Genetics, historical background of, 1 
Genome 

consumer services providing, 295b 

definition of, 3 

structure of, 20-22, 21/-22/ 

Genome scan, 248^ 

Genome-wide association studies, 172-173, 
173/260 

Genomic imprinting, 93-101, 94/ 

Genomic instability, 225 
Genomic library, 173, 174^7, 174/ 

Genomics, impact of, 293-299 
Genotype, 28, 60-63 
Genotype frequency, 55 
Genotype-phenotype correlations, 69-72 
Germline mosaicism, 68, 68/ 

Germline mutations, 28 
radiation and, 35, 57b 
Germline therapy, 288 
Giemsa banding (G-banding), 104, 106/ 

Gli transcription factors, 200/ 

GLI3 gene, 213 
Gliomas, 217 

Global epigenomic alterations, 231/ 

Globin chains, 32 

protein electrophoresis and, 41 
Globin disorders, 92 
a-globin genes, 34 
p-globin genes, 34 
Glucocerebrosidase, 145-147 
Glucocorticoid-remediable aldosteronism, 
253t-254t 

Glucose, metabolism of, 136/ 137 
Glucose-6-phosphate dehydrogenase 
(G6PD) 

deficiency of, primaquine and, 294 
X chromosome mosaicism and, 80 
p-glucosidase, 145-147 
Glycogen, metabolism of, 136/ 137-138 
Glycogen storage disease, type 2, 278t 
Glycogen storage disorders, 137-138, 138t 
Gmi gangliosidosis (infantile), 146f 
Goiter, 315r 
Golgi apparatus, 7/ 

Gordon syndrome, 253f-254t, 255 
Gorlin syndrome, 199, 200/ 

Graves disease, 192f 
Grebe chondrodysplasia, 199-202 
Greig cephalopolysyndactyly, 213 
Growth factor genes, 224f 
Growth factor receptors, 219/ 

Growth factors, 219-220 
Growth hormone deficiency, familial isolated, 
67 

Growth retardation, 152 
Guanine, 6, 7/-8/ 

Guanosine diphosphate (GDP), 225 
Guanosine triphosphate (GTP), 225 
Gynecomastia, 117/ 


H 

Hand-foot-genital syndrome, 214 
Haploid cells, 6 
Haploinsufficiency, 31-32 
Haplotype, 154 

Hardy-Weinberg principle, 55-56, 56/ 

Haw River syndrome, 98t-99t 
Hearing loss, 315t 
Heart defects, 315t 
congenital, 2t 

Heart disease, 2f, 3, 4/ 28, 239, 243-244, 

247-252, 250/7-252/7, 253f-254f, 262-263, 
296/7 

cancer and, 255-256 
congenital, 241, 279b 
coronary, 249t 
form of, 249 
genetics and, 1, 3 
ischemic, gene therapy for, 289 
major cause of, 257 
obesity and, 258 
risk factor for, 254 
Heart malformations, 38f 
Heavy chains, of immunoglobulin chain, 
185-186, 186/ 
genes for, 187/ 

and light chains, multiple combinations of, 
188 

Heavy metal transport defects, 134f 
Heavy metals, 149 
Hedgehog family, 199, 200/ 

Helix-loop-helix motif, 14/ 14f 
Helix-turn-helix motif, 14t 
HELLP syndrome, 141 
Helper T cell, HIV infection of, I9lb 
Helper T lymphocytes, 180-181, 181/ 
Hemarthrosis, 83/ 

Heme, 32 
Hemizygosity, 88 

Hemochromatosis, 2t, I54t, 175t-176t, 274b 
hereditary, 150/7, 150/ 

HLA allele associated with, 165-166, 191, 

192t 

Hemoglobin, genes for, 32, 33/ 

Hemoglobin disorders, 33t. See also Sickle cell 
disease; a-Thalassemia; p-Thalassemia 
mutation and, 32-35 
newborn screening for, 268 
Hemoglobin H (HbH) disease, 55t 
Hemoglobinopathies, 266b 
Hemolytic anemia, 191b 
Hemolytic disease of the newborn, 193 
Hemophilia, 3, 310/7, 311 
Hemophilia A, 2t, 4/ 30, 39, 81, 82/7-83/7, 
82/-83/, 175M76t, 305/7-306/7 
Bayesian analysis of, 306 
female homozygotes and, 88 
germline mosaicism in, 68 
Hemophilia B, 30, 83, 175M76r, 283, 284f, 288 
Hepatic insufficiency, 136-137 
Hepatitis C, 287 
Hepatoblastoma, 95, 502b 
Hepatomegaly, 137-138, 147 
Hereditary acrodermatitis enteropathica, 149 
Hereditary fructose intolerance (HFI), 136-137, 
136/ 

Hereditary neuropathy, with liability to pressure 
palsies, 29 
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Hereditary nonpolyposis colon cancer genes, 

233 

Hereditary nonpolyposis colorectal cancer 
(HNPCC), 2t, m, 175M76f, 256, 270, 
310-311 

Hereditary persistence of fetal hemoglobin 
(HPFH), 32-33 

Hereditary tyrosinemia type 1 (HTl), 138-140 

Heritability of multifactorial traits, 245-246 

Hers disease, 138t 

Heterochromatin, 14 

Heterodisomy, 125-126 

Heteromorphisms, 166-167 

Heteroplasmy, 92 

Heterotetramers, 33-34 

Heterozygosity, loss of, 226, 227/ 

Heterozygote, 28 
compound, 35, 36/ 
manifesting, 90 

Heterozygote screening, 266, 268-269, 269t 
for cystic fibrosis, 269f 
in ethnic groups, 269f 
public policy guidelines for, 269b 
High-resolution banding, 104-105 
High-throughput DNA sequencing, 50-52, 52/ 
Hirschsprung disease, 204, 204/?, 205/ 236, 247 
Histone, 7-8, 9/ 
acetylation of, 14 

deacetylation of, on inactive X chromosome, 
81 

hypoacetylation of, in imprinting, 93 
Holandric genes, 91 
Holoprosencephaly, 206, 279b 
Holt-Oram syndrome, 14f, 214, 214/ 
Homeodomain, 208 

Homeotic gene complex (HOM-C), 208, 209/ 
Homocystinuria, 134f, 266b, 268 
Homologous chromosomes, 6 
meiosis and, 23, 26/ 

Homologs, 6 
Homotetramers, 33-34 
Homozygote, 28, 35, 36/ 

compound heterozygotes as, 35 
Housekeeping genes, 12-13, 91 
Hox genes, 208, 209/ 214 
Human artificial chromosomes, 4:4:b-45b, 285 
Human gene map, 157-161, 160/ 

Human Genome Project, 3, 169^?, 248^?, 317 
Human immunodeficiency virus (HIV), 
193-194, 283 
CCR5 and, 191^7, 288 
gene therapy aimed at, 288 
helper T-cell infection by, 191^7 
hemophilia patients and, 82 
HLA genes and, 190 

Human immunodeficiency virus-1 (HIV-1), 298 
Human leukocyte antigen (HLA) loci, 188-190, 
192f 

ankylosing spondylitis and, 166, 166r 
diabetes type I and, 166 
hemochromatosis and, 165-166 
Human papilloma virus, and cervical cancer, 

229 

Humoral immune system, 180-182, 183/ 
comparison with, 184-185 
Hunter syndrome, 145, 145t 
Huntingtin, 71 
Huntington chorea, 69 


Huntington disease, 2f, 7\b, 71/ 97-99, 98f-99f, 
157-159, 175M76f, 209b-2llb, 266b, 270, 
274b, 281, 285, 287, 310^7, 318 
age of onset for, 69, 69/ 
inheritance of, 1-3 
Huntington disease-like 2, 98r-99f 
Hurler-Scheie syndrome, 145, 145t 
Hurler syndrome, 145, 146/ 

Hydrocephalus, 211^7-212^7,279^7, 282, 315f, 317/? 

Hydronephrosis, 279b 

Hydrops, 279, 279^7 

Hydrops fetalis, 33t, 34 

Hyper IgE syndrome, 195r 

Hyperammonemia, 132, 135 

Hypercalcemia, 122 

Hypercholesterolemia, familial, 2t, 31-32, 

175M76r, 250b-252b, 250/ 253t-254t, 263 
Hyperglycemia, 137 
Hyperlipidemia, 140 

Hypermethylation, of cancer-related gene, 230 
Hypermutation, somatic, 182, 187-188 
Hyperphenylalaninemia, 138, 266b 
Hypertension, 149, 247-249, 253f-254f, 254-255, 
292, 293b, 296b, 298 
genetics and, 3 

neurofibromatosis and, 72-73 
Hypertrophic cardiomyopathy, 249 
familial, 253f-254r 

Hypochondroplasia, 200^7-202^7, 201/ 
Hypoglycemia, 135, 137-138 
Hypohydrotic ectodermal hypoplasia, 88f 
Hypolactasia, 134t, 137 

Hypomethylation, of cancer-related genes, 225 
Hypoparathyroidism, 151 
Hypophosphatemic rickets, 90 
Hypothyroidism, 112, 112^7, 266b, 268, 315t 
congenital, newborn screening for, 269t 
Hypotonia, 268-269 

I 

Iduronidase, deficiency of, 145, 146/ 

Immune response, 180-185 

adaptive immune system, 180-181, 181/ 
cellular, 180-184, 185/ 
humoral, 180-182, 183/ 
genes for, 192f 

innate immune system in, 180 
comparison with, 184-185 
pathogen evasion of, 19117 
proteins of, 185-188 
Immune system, dysfunction of, 152 
Immunodeficiency diseases, 193-194 
primary, 193-194, 195r 
secondary, 193-194 
Immunogenetics, 180-196 
Immunoglobulins, 181-182, 184/ 
classes of, 185-186 
diversity of, 186-188 
genes for, 192f 

molecules and genes of, 185-186, 186/-187/ 
Imprinting, of genes, 79, 93-101 
In utero intervention, 282 
In vitro fertilization, preimplantation genetic 
diagnosis and, 281 

Inborn errors of metabolism, 1, 265, 266^ 
fetal treatment of, 282 
newborn screening for, 268 
prenatal diagnosis of, 278, 278t 


Incest, 77 

Incontinentia pigmenti type 1, 90 
Independence principle, 55 
Independent assortment, 154, 155/ 
principle of, 60 
Index case, 63 

Indirect diagnosis, 271, 272t 
Induced mutations, 35 
Induction, in embryo, 198-199, 206 
Infantile Refsum disease, 144 
Infections, 113, 149 
Influenza, 4/ 

Informative mating, 159-160, 161/ 

Inherited breast cancer, 233-234 
Inherited cancer-causing genes, identification 
of, 226-237 

Inherited cardiomyopathy, 275 
Insertions, of base pairs, 28-29, 30/ 

Inserts, in plasmids, 44^-45^7, 44/ 

Insulin, pancreatic development and, 215 
Insulin gene, 257 

Insulin-like growth factor 2 (IGF2), Beckwith- 
Wiedemann syndrome and, 95 
Insulin promoter factor 1 (IPFl), 215 
Insulin receptor gene, mutations in, 137 
Insulin resistance, 96 

Insurance providers, discrimination by, 318 
Integrins, 205 
Intellectual disability, 122 
Interphase, 22, 22/ 

Interphase I, 23 
Interphase II, 26 
Interstitial deletion, 120 
Intracranial hemorrhage, 315f 
Intrauterine growth retardation, 279, 279^ 
Introns, 15, 15/ 20-21 
function of, 20 
single-copy DNA and, 21 
unrelated genes contained in, 21 
Inversions, chromosomal, 126-127, 127/ 
Ionizing radiation, 35 

mutation rates due to, 37^ 

IPEX (immunodysregulation, 

polyendocrinopathy, enteropathy, 

X linked), 192 
IRAK-4 deficiency, 195t 
Ischemic heart disease, gene therapy for, 289 
Isochromosomes, 127, 128/ 

Isodisomy, 125-126 

Isolated growth hormone deficiency (IGHD), 

67 

Isolated malformations, 313 
Isoniazid, 294, 294f 
Isotretinoin, as teratogen, 315r 
Isotypes, of immunoglobulins, 185-186 

J 

Jackson-Weiss syndrome, 202r 
Jak3, 194 

Jervell-Lange-Nielsen syndrome, 252 
Joint probability, 305 
/[/ATgene, 219-220, 219/ 

Junctional diversity, 187 

Junctional epidermolysis bullosa (JEB), 205 

Juvenile amyotrophic lateral sclerosis, 253r-254r 

Juvenile Huntington disease, 71 

Juvenile myoclonic epilepsy, 97-99 

Juvenile retinoschisis, 88t 
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K 

Karyogram, 103 
Karyotype, 103, 104/ 

banding pattern of, 104, 106/ 
nomenclature for, 105f 
spectral, 105-107, 107/ 

Kayser-Fleischer ring, 151 
Kearns-Sayre disease, 93 
Kearns-Sayre syndrome, 2t 
Ketonuria, 135 
Kidney cancer, 95 
Kidney damage, 151 

Killer cell immunoglobulin-like receptors (KIR), 
190 

Killer T lymphocytes, 182, 185/ 

Kilobases (kb), 7-8 

Klinefelter syndrome, 2t, 116-117, 117/ 

Barr body and, 81 
Klippel-Trenaunay syndrome, 312f 
Knockout mouse, 208, 209^7-211/;, 210/ 

Krabbe disease, 146t 

KRAS gene, in colon cancer, 230 

L 

Lactase nonpersistence, 137 
Lactase-phlorizin hydrolase, 137 
Lactose, 137 
Lactose intolerance, 137 
LAMC2 gene, 205 
Laminins, 205 

Langer-Giedion syndrome, 123f 
Laterality, of complex disorders, 262-263 
Leber congenital amaurosis, 163^?, 284f 
Leber hereditary optic neuropathy (LHON), 2t, 
92 

Left/right axis, disorders of, 211^7-212^?, 212/ 
Left-right organizer (LRO), 211b-212b 
Lentiviral viruses, 283 
Leptin hormone, 259 
Leri-Weill dyschondrosis, 88t 
Lesch-Nyhan syndrome, 27St 
Lethal mutation, genetic, 58 
Leucine zipper, 14t 
Leukemia-like disease, 284b 
Leukemias, 37, 38t, 217, 297 
cytogenetic changes in, 129t 
Down syndrome and, 110 
Leukocyte adhesion deficiency, 195f 
Leukodystrophy, metachromatic, 146f 
Liability distribution, 239-240, 240/ 

Liddle syndrome, 253f-254f, 255 
Li-Fraumeni syndrome, 175r-176r, 222r, 223, 
229,237 

Light chains, of immunoglobulin chain, 186/ 
genes for, 187/ 

and heavy chains, multiple combinations of, 

188 

Likelihood ratio, 157 

Limb, development of, 213-215 

Limb bud, formation of, 213, 214/ 

Limb defects, 315f 
Limb deficiencies, 278, 279b, 3l5t 
LINE (long interspersed elements), 21-22 
Linkage, vs. association, 164-166 
Linkage analysis, 154-157, 155/~156/, 248^ 
candidate genes found by, 168, 168/ 
for genetic diagnosis, 271, 271/ 
in genome-wide association studies, 172 


Linkage analysis (Continued) 

human gene map and, 157-161, 160/ 
of inherited cancer-causing genes, 226 
LOD scores in, 157, I58b-I59b, 158/-159/ 
close to zero, 161 
multifactorial disease, 248^ 
multipoint mapping in, 158^-159^ 

Linkage disequilibrium, 161-164, 164/ 
as form of association, 163-164, 166 
in genome-wide association studies, 173, 173/ 
MHC-disease associations and, 191 
Linkage equilibrium, 163-164 
Linkage phase, 156 

polymorphic markers and, 160-161, 165/ 
Linked loci, 154, 155/ 

Lipid disorders, 134t 

Lipid storage diseases, 145-147, 146f 

Lipoprotein genes, 249f 

Lipoprotein lipase deficiency, 284f 

Liposome, 285 

Lisch nodules, 72 

Liver cancer, 95 

Liver cirrhosis, 273b 

Locus (loci), 28 

genetic distance between, 155 
syntenic, 155 
Locus control region, 35 
Locus heterogeneity, 72-74, 74/ 

LOD score of zero and, 161 
of retinitis pigmentosa, I63b 
vs. multifactorial inheritance, 243-244 
LOD scores, 157, I58b-I59b, 159/ 
close to zero, 161 

Long arm (q), of chromosome, 103 
Long interspersed elements (LINEs), 21-22 
Long QT syndrome, 175r-176f, 253f-254f, 272f, 
273-274, 293-294 

Long-chain acyl-CoA fatty acids, 141 
Long-chain L-3-hydroxyacyl-CoA 

dehydrogenase (LCHAD) deficiency, 

141 

Losartan, 76 

Loss of heterozygosity, 226, 227/ 
Loss-of-function mutations, 31-32, 32/ 

Lou Gehrig’s disease, 253t-254f 
Low birth weight, 315t 
Low-copy repeats, 122-123 
Low-density lipoprotein (LDL) receptors, 249, 
250b-252b, 251/ 

Lower limb paralysis, 317^? 

Lung cancer, 218, 229, 244 

expression profiles in, 297, 297/ 

Lung tumors, 287-288 
Lymphomas, 217, 297 
T-cell, 23lb 
Lynch syndrome, 233 
Lyon hypothesis, 79 

Lysosomal storage disorders, 144-147, 146/, 

146t 

M 

Machado-Joseph disease, 98t-99t 
Macrophages, 181 
infection of, 191^ 

Macular degeneration, 274b, 287 
Magnetic resonance imaging (MRI), prenatal, 
279 

Major gene, 243 


Major histocompatibility complex (MHC), 
188-193, 188/-189/ 
class I, 182, 185/ 188-190, 189/ 
class II, 181-182, 183/-184/, 188-190 
class III, 188-190 

and disease associations, 190-192, 192f 
genes for, 192t 
T-cell receptors and, 190 
Malaria, 298 

HLA allele associated with, 192r 
sickle cell heterozygotes and, 56-57, 57/ 

Male infertility, 315t 
Male-pattern baldness, 91 
Malformation, definition of, 313 
Malignant peripheral nerve sheath tumors, NFl 
and, 72 

Malignant tumor, 232/ 

Manifesting heterozygotes, 90 
Maple syrup urine disease, 134f, 139-140, 266h, 
278t 

Marfan syndrome, 2t, 39-40, 74, 75b-76b, 75/ 
122, 157-159, 171-172, 175M76r, 205, 
274^7, 285, 303/7, 309-310, 312-313, 312t 
Markers, polymorphic, 156, 159, 160/ 
on chromosome 9, 161/ 
direct observation of families and, 161, 162/ 
in genome-wide association studies, 173/ 
in Human Genome project, \69b 
locus heterogeneity and, 161 
positional cloning and, 168, 168/ 
usefulness of, 159-160 
Maroteaux-Lamy syndrome, 145, 145f 
Mass spectrometry 

for mutation detection, 53f 
tandem, in newborn screening, 268 
Maternal serum screening, 275, 279-280, 280/ 
Mature transcript, 15, 15f 
Maturity-onset diabetes of the young (MODY), 
134f, 137, 258 

Maximum likelihood estimate, 158/7-159/7 
McArdle disease, 138t 
MDM2 protein, 235 
Measles, 3, 4/ 245f 
Meckel-Gruber syndrome, 313-314 
MECP2 gene, 90 
Medial/distal axis, 198-199 
Medical genetics. See also Clinical genetics 
application of principles of, 302-303 
bioethics and, 317-318 
definition of, 1 
history of, 1-3 

importance to health-care practitioner, 1 
as mainstream specialty, 301 
Medium-chain acyl-CoA dehydrogenase 

(MCAD) deficiency, 134f, 135, 139, 141 
Meiosis, 6, 23-26 

gametogenesis and, 26 
in male, X and Y crossover in, 116, 116/ 
nondisjunction in, 109, 109/ 
spermatogenesis/oogenesis and, 25/ 
stages of, 25/ 

Meiosis I, 23, 25 

nondisjunction in, 109, 109/ 

Meiosis II, nondisjunction in, 109, 109/ 

Meiotic failure, 108-109 
Melanoma, 224t, 226, 235-236 
familial, 175M76f, 222f, 235-236 
gene therapy for, 287-288 
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Melanoma (Continued) 
malignant, 230 
sporadic, 236 

MELAS (mitochondrial myopathy, 

encephalopathy, lactic acidosis, and stroke), 
92-93, 252-254 
Memory B cells, 182 
Memory T cells, 184 
Mendel, Gregor, 1, 3/, 6, 28, 154 
contributions of, 60 
Mendelian conditions, 3 
Mendelian Inheritance in Man (McKusick), 3, 

60 

Mendelian traits, 60 
Meningioma, 129f 
Meningomyelocele, 279/ 

Menkes disease (MND), 134f, 149-151 
Mental retardation, 38f 
MERRF (myoclonic epilepsy with ragged red 
fiber syndrome), 92-93 
Mesenchyme, 206-207 
Mesoderm, 206-208 
Messenger RNA (mRNA), 11/-12/ 
splicing of, 15 
synthesis of, 12 

testing for gene expression and, 170-171, 
170/-171/ 

translation of, 15-16, 17/ 

Metabolic defects, inheritance of, 133 
Metabolic disorders, 132-153, 134f 
of amino acid metabolism, 138-140 
of carbohydrate metabolism, 133-138 
classification of, 133 
of degradative pathways, 144, 145f 
diagnosis of, 132, 135^; 
dietary management of, 139^7 
of energy production defects, 134f, 148 
of lipid metabolism, 140-142 
prevalence of, 132-133, 134r 
of steroid hormones, 142-143, 143/ 
of transport systems, 149-152 
of urea cycle, 147-148 
Metabolic pathways, 133 
Metabolic processes 
defects of, 133-152 
types of, 133 

Metabolism, variants of, 132-133 
Metacentric chromosome, 103, 104/ 
Metachromatic leukodystrophy, 146f 
Metaphase, 23, 24/ 

Metaphase 1, 24, 25/ 

Metaphase II, 25/ 26 
Metastasis, 230/ 

Methimazole, as teratogen, 315r 
Methotrexate, as teratogen, 315f 
Methylation 

of imprinted alleles, 93 
in monozygotic twins, 246 
Methylmalonic acidemia, 134f, 278t 
MHC. See Major histocompatibility complex 
(MHC) 

MHC restriction, 190 
Microarrays, DNA, 53, 54/ 
in cancer, 297/ 
in common disease, 296-297 
in comparative genomic hybridization, 107 
in gene expression testing, 171 
in genetic diagnosis, 273 


Microarrays, DNA (Continued) 

in genome-wide association studies, 172-173, 
172/ 

in mutation detection, 53t 
Microcephaly, 316^ 

Microdeletion syndrome, 120-123, 122/ 
of chromosome 22, 124i» 
types of, 123t 
Williams syndrome as, 122 
p 2 -Microglobulin, 189/ 

Micrognathia, 315f 
MicroRNAs (miRNA), 14-15 
Microsatellite instability, colorectal carcinomas 
and, 233 

Microsatellite (STR) polymorphisms, 227/ 
Microsatellites, 21 

polymorphism of, 45-46, 48/ 

Microtia, 315f 

Miller-Dieker syndrome, 123, 123t 
Mineralocorticoid excess, syndrome of 
apparent, 253f-254f 
Minisatellites, 21 

polymorphism of, 45-46, 48/ 

Mismatch cleavage, 53f 
Mismatch repair, 225 
DNA, 39 

hereditary nonpolyposis colon cancer genes 
and, 233 

Missense mutations, 28-29, 30/ 

Mitochondrial diseases, 3, 79, 92, 93/ 
prevalence of some, 2t 
Mitochondrial DNA (mtDNA), 92, 92/ 
codons of, 15 

Mitochondrial encephalopathy, lactic acidosis 
and stroke-like episodes (MELAS), 2t, 
92-93 

Mitochondrial inheritance, 92-93 
Mitochondrial myopathy, encephalopathy, lactic 
acidosis, and stroke (MELAS), 252-254 
Mitochondrial trifunctional protein (TFP), 141 
Mitosis, 6 

cell cycle and, 22/ 23 
chromosomes stability during, 233 
stages of, 24/ 

Mitral regurgitation, Marfan syndrome and, 

75 

Mitral valve prolapse, Marfan syndrome and, 

75 

MLHl gene, 222t 

MLPA. See Multiplex ligation-dependent probe 
amplification (MLPA) 

Mobile elements, 30 
Modifier loci, 69-72 

MODY. See Maturity-onset diabetes of the 
young (MODY) 

Molecular genetics, 3 
Monochromacy, 86 
Monoclonal tumor, 217 
Monogenic traits, 60 
Mononucleosis, \9\b 
Monosomy 

autosomal, 109, 109/ 

of X chromosome, 114-115, 115/ 

Monosomy 14, 120, 121/ 

Monosomy 21, 120, 121/ 

Monozygotic twins, 244, 244/ 

“Morning sickness,” 316b 
Morquio syndrome, 145f 


Mosaicism 

confined placental, 277-278 
in Turner syndrome, 115 
fetal, 276, 278 
germhne, 68, 68/ 

NFl mutation causing, 73 
tissue-specific, 112 
in trisomy 21, 110 
MSH2 gene, 222t 

Mucopolysaccharidoses, 144-145, 145r, 278r 
Muenke syndrome, 202f 
Multifactorial diseases, 239-264 

congenital malformations of, 247, 247f 
influence of genes on, 246, 248^ 
recurrence risks for, 241, 24If 
threshold model of, 239-241 
Multifactorial disorders, 3, 248^ 
in adult population, 247-262 
alcoholism, 260-261 
Alzheimer disease, 248^7, 253f-254f, 
259-260, 260/ 262-263 
cancer, 255-256 

cardiovascular disorders, 247-255 
diabetes mellitus, 245f, 257-258, 258f 
obesity, 247-249, 254-255, 258-259, 258f 
psychiatric disorders, 261-262, 26If 
childhood deaths attributable to, 4f 
conclusions, 262-263 
congenital malformations, 247 
prevalence of some, 2f 
Multifactorial inheritance, vs. single-gene 
inheritance, 243-244, 244/ 

Multifactorial model, 239, 240/ 

Multifactorial traits, 239 
heritability of, 245-246 
normal distribution of, 239 
Multi-hit concept of carcinogenesis, 220 
Multiple carboxylase deficiency, 278f 
Multiple endocrine neoplasia, 224f, 236-237 
Multiple endocrine neoplasia type 2 (MEN2), 
270 

Multiple epiphyseal dysplasia, 98f-99f 
Multiple germline immunoglobulin genes, 187 
Multiple peripheral pulmonary arterial stenoses, 
122 

Multiple sclerosis, 192f, 245f 
Multiple sulfatase deficiency, 146f 
Multiplex ligation-dependent probe 
amplification (MLPA), 53f 
Multiplication rule, 55 
Multipoint mapping, 158^7-159^7 
Muscular dystrophy, 84, 273-274 
Becker, 84 

Duchenne. See Duchenne muscular 
dystrophy 

myotonic, 96, 96/-97/ 98f-99f 
oculopharyngeal, 98f-99f 
Mutagens, 35 
Mutation 

in cancer, 233 
causes of, 35-39 
definition of, 28 
detection of 

at DNA level, 52-53, 53f 
methods of, 53f 
genetic diseases and, 6 
germline, 28 

in hemoglobin disorders, 32-35, 33f 





Mutation (Continued) 
induced, 35 

molecular consequences of, 31-32 
new, 67-68, 302-303 
protein product resulting from, 31, 32/ 
sex-linked, 79 
somatic, 28 
spontaneous, 35 
types of, 28-31 
vs. polymorphism, 28 
Mutation hot spots, 39 
Mutation rates, 39-40 

of mitochondrial DNA, 92 
radiation and, 37b 
Mutation-selection balance, 58 
Myasthenia gravis, 192f 
M7C gene, 219-220,219/ 

Myelin, 29 

Myelomeningocele, 282 
Myocardial infarction, 245f 
Myoclonic epilepsy with ragged red fiber 
syndrome (MERRF), 2t, 92-93 
Myopathy, 92-93 

Myopia, Marfan syndrome and, 75 
Myotonic dystrophy, 2t, 96-99, 96/-97/, 
175M76t, 209 / 7 - 211 / 7 , 272f, 281 

N 

N-acetylglutamate synthetase, 147/ 
NADH-CoQ reductase deficiency, 134t 
Narcolepsy, 190, 192f 
Nasal hypoplasia, 315f 
Natural killer cells, 180, 190 
evasion of, 190 
Natural selection, 56 

linkage disequilibrium and, 164 
Nature vs. nurture, 244-247 
Nausea, 316/7 
Neanderthal DNA, 50 
Negative family history, 303b 
Negative predictive value, 267, 267f 
Neonatal adrenoleukodystrophy, 144 
Neonatal screening, for phenylketonuria, 266b 
Neoplasm, 217 

Nephrogenic diabetes insipidus, 88f 
Neural crest cells 

in craniofacial development, 208-213 
development of, defects in, 204/7, 205/ 
neurulation and, 204/7 
Neural tube, 206 

Neural tube defects {NTDs}, 2f, 242/7-243/7, 
242/ 247, 247f, 248/7, 277-278, 277/?, 280, 
315r, 316/7, 317 
a-fetoprotein and 
in amniotic fluid, 275 
in maternal serum, 275, 279-280, 280/ 
folate and, 317b 
North America, 241 
Neuroblastoma, 129t 
Neurofibromas, 72, 303b 
Neurofibromatosis, I75t-176t, 302, 310b 
Neurofibromatosis type 1 (NFl), 2t, 30, 39, 72, 
73/ 156-159, 209 / 7 - 211 / 7 , 274 / 7 , 311, 312t 
clinical management of, 73 
diagnosis of, 301 
germline mosaicism in, 68 
introns of, 21 

with negative family history, 303b, 303/ 
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Neurofibromatosis type 1 (NFl) (Continued) 
RAS-MAPK mutations in, 203/ 
variable expression of, 72b-73b 
Neurofibromatosis type 2 (NF2}, 73 
Neurofibromin, 72-73, 228 
Neurulation, 206 
New mutation, 67-68, 302-303 
Newborn screening, 266, 268, 269f 
expanded, 133, 268 
for phenylketonuria, 269t 
tandem mass spectrometry in, 268 
NFl gene, 158/7-159/7, 222t, 228, 228/ 
evidence for mapping, 228 
linkage analysis of, 156, 157/ 
mutations in, 73 

translocations in mapping of, 167 
NF2 gene, 73, 222t 
Niemann-Pick disease, 54/ 146f 
Nitrogenous bases, types of, 6 
Nodal, 208 
Node, 208 

Noggin gene, 202-203, 208 
Noggin protein, 206 
Nonalcoholic liver cirrhosis, 273b 
Nondirectiveness, in genetic counseling, 
305-307 

Nondisjunction, 109, 109/ 
maternal age and, 1 14 
Nonionizing radiation, mutations and, 35 
Nonmendelian inheritance, 79 
Nonobese diabetic (NOD) mouse, 257 
Nonsense mutations, 28-29, 30/ 35 
Nontraditional modes of inheritance, 79-102 
Noonan syndrome, 202-203, 203/ 312t 
Norprogesterones, as teratogens, 315f 
Northern blotting, 170-171, 170/ 

Notochord, 206-207 
NRAS gene, 224f, 235-236 
Nucleolar organizing region (NOR) stains, 104 
Nucleosome, 7-8, 9/ 

Nucleotide excision repair (NER), 37/7-38/7, 39 
Nucleotides, 7 

in DNA replication, 8, 10/ 

Nucleus, of cell, 6, 7/ 

Nystagmus, 65-66 

o 

Obesity, 247-249, 254-255, 258-259, 258f, 292, 
293 / 7 , 296/7 
type 2 diabetes, 258 
Obligate carriers, 68-69 
Occipital encephalocele, 206 
Occipital horn syndrome, 149 
Ocular albinism, X-linked, 79-80, 80/ 
Oculocutaneous albinism, 65-66, 65/ 134t 
Oculocutaneous tyrosinemia, 140 
Oculopharyngeal muscular dystrophy, 98t-99t 
Oligohydramnios, 279b, 315t 
Oligohydramnios sequence, 313-314, 314/ 
Oligonucleotide primers, 48-49 
Omphalocele, 112-113, 275-276, 279b, 302b 
Oncogenes, 223-225 

compared to tumor suppressor genes, 224r 
examples of, 224t 
identification of, 225 
lp36 deletion syndrome, 123t, 125/ 
lq36 deletion syndrome, 125/ 

Oocytes, 25/ 26 


Oogenesis, 25/ 26 
Oogonia, 23 

Ophthalmoplegia, chronic progressive external, 
93 

Optic atrophy, 3St 
Optic glioma, 3Q3b 
Optic nerve, benign tumors of, 72 
Optic pathway gliomas, 72 
Oral contraceptives, venous thrombosis and, 
294f 

Organic acid disorders, 134f 
Organogenesis, 198-199, 208, 215 
Ornithine transcarbamylase, 147/ 

Ornithine transcarbamylase deficiency, 134f, 

148, 288 

Orofacial clefts, 315t 

Osteogenesis imperfecta, 2t, 18, 1 8 / 7 -19/7, 

19/-20/, 279 / 7 , 312f 
germline mosaicism in, 68 
locus heterogeneity and, 72-74 
pleiotropy in, 74 
subtypes of, 18f 

Osteogenesis imperfecta type 1, 161 
Osteoporosis, 3St 
Osteosarcomas, 70 

Otopalatodigital spectrum disorders, 88f 
Ovarian cancer, 230, 270, 272t, 273-274, 311b 
Ovum, mature, 26 

P 

p arm of chromosome. See Short arm (p), of 
chromosome 
p53 protein, 229 
colon tumors, 230 
melanomas and, 235 
P63 protein, 215 
Palindromes, 44/7-45/7 
Pallister-Hall syndrome, 213 
Pancreas 

cancer of, 273-274 
cystic fibrosis of, 61, 61/ 

Panmixia, 56 

Papillary thyroid carcinoma, 236 
Paracentric inversion, 126 
Paracrine factors, 199 

Paracrine signaling molecules, 199-203, 203/ 
Paralogs, 208,214 

Parkinson disease, 253t-254f, 262, 283, 
288/7-289/7 

autosomal recessive early onset, 175t-176t 
familial, 175f-176f 
Partial trisomy, 119, 119/ 

Patau syndrome, 113-114, 113/ 127-128 
PATCHED gene, 199 
Paternal age effect, 39-40, 40/ 

Pattern formation, 198-199, 205-215 
PAX6 gene, 197, 198/ 

Pearson syndrome, 93 

Pectus carinatum, Marfan syndrome and, 75 
Pectus excavatum, Marfan syndrome and, 75 
Pedigree, 63, 63/ 311b 
Pelizaeus-Merzbacher disease, 88f 
Penetrance 

age-dependent, 69 
reduced, 68-69 

Penicillamine, as teratogen, 315r 
Percutaneous umbilical blood sampling (PUBS), 
278 
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Pericentric inversion, 126, 127/ 

Peripheral neuropathy, 141 

Peroxisomal enzymes, 144 

Peroxisome biogenesis disorders (PBDs), 144 

Personal genomics, 296b 

Peutz-Jeghers syndrome, 256 

Pfeiffer syndrome, 200b-202b, 202t 

Phagocytes, 180 

Pharmacogenetics, 293 

Pharmacogenomics, 293 

Phenotype, 60--63 

Phenylalanine, metabolism of, 133/ 138 
Phenylalanine hydroxylase (PAH), 138 
Phenylketonuria (PKU), 2t, 60-63, 133, 134f, 
138-139, 170-171, 175M76f, 265-268, 

266^7, 296^,314 

dietary management of, 139^7, 139/ 139t 
genetic drift and, 57-58 
heterozygote screening for, 269f 
inheritance of, 1-3 
neonatal screening for, 266^, 269f 
Phenytoin, as teratogen, 315f 
Pheochromocytoma, neurofibromatosis and, 73 
Philadelphia chromosome, 128-129, 129/, 225 
Photosensitivity, 38f 
Physical mapping, 154, 166-172 
candidate genes in, 171-172 
chromosome morphology in, 166 
computer analysis in, 168-170 
deletions and, 166-167, 167/ 
dosage mapping and, 167 
functional vs. nonfunctional DNA in, 168 
gene expression testing in, 170-171, 170/-171/ 
nonhuman genes and, 169-170 
positional cloning and, 167-168 
screening for mutations in, 170 
translocations and, 167 
Pierre Robin sequence, 313 
Plasma cells, 182 
Plasmid, 44h-45^ 

Pleiotropy, 74 

Plexiform neurofibromas, 72 
PMP22 gene, 29 

Pneumonia, sickle cell disease and, 33 
Pneumonitis, 19117 
Point mutation, 35 
Polar body, 25, 25/ 
diagnosis, 281 
Polarity, 198-199 
Poly-A tail, 12 

Polycystic kidney disease, 175f-176f, 211l7-212l7 
adult, 2t 

locus heterogeneity and, 72 
Polycystin-2, 21117-21217 
Polydactyly, 57, 313 
postaxial, 63-64, 64/ 

inheritance patterns of, 67 
pedigree illustration of, 64f 
Polygenic genes, 239 
Polyhydramnios, 279b 
Polymerase chain reaction (PCR) 

DNA amplification using, 48-50, 49/ 
in mutation detection, 53f 
Polymorphisms, 137 
definition of, 28 
detection of, 42 
in restriction site, 45-46 
of tandem repeat, 45-46 


Polypeptides, 15 

ribosomal synthesis of, 16, 17/ 

Polyploidy, 108-109 
Polyposis, familial, 30 
Polyposis coli, familial, 175r-176f 
Pompe disease, 137-138, l3St 
Population genetics, 1-3 
Population screening, 265-271 
Populations 

association in, 164-166 
genetic variation in, 53-58 
Positional candidate approach, 171-172 
Positional cloning, 167-168 

Human Genome Project and, 169^ 

Positive predictive value, 267, 267f 
Postaxial polydactyly, 63-64, 64/ 
inheritance patterns of, 67 
pedigree illustration of, 64/ 

Posterior probability, 305 
Posttranslational modification, 18 
of collagen type I, 19 
Potter phenotype, 313-314, 314/ 

Potter syndrome, 313-314 
Prader-Willi syndrome, 93-95, 94/, 95b, 105, 
120-123, 166-167, 272t,312t 
microdeletion in, 123t 
unilateral disomy and, 125-126 
pRb, 70, 229 

familial melanoma and, 235 
Preaxial defects, 315f 
Precancerous colon polyps, 271 
Precision/predictive medicine 

cancer genomics and, 297-298, 297/ 
common disease and, 296-298 
definition of, 292 
future of, 299 
genetics and, 292-300 
implementation of, 292 
pharmacogenetics in, 293 
race and genetic assessment of individual 
ancestry in, 298-299, 299/ 
technology-driven transformation for, 292, 
293b 

Preconceptional counseling, 314 
Pregnancy 

acute fatty liver of, 141 
immunologic challenge of, 191^ 

Pregnancy loss, chromosome abnormalities and, 
117-118 

Pregnancy termination, 317 
Preimplantation genetic diagnosis (PGD), 281 
ethical debates in, 318 
Premature labor and delivery, 315f 
Prenatal diagnosis, 275-281 

amniocentesis in, 275-277, 216b-211b, 276/ 
Down syndrome and, 280 
early, 276-277 

inborn errors of metabolism and, 278, 

27St 

bioethics and, 317 

chorionic villus sampling in, 277-278, 277/ 
inborn errors of metabolism and, 278, 278f 
with fetal DNA in maternal circulation, 281 
fetal treatment and, 282 
of genetic disorders, 266 
of inborn errors of metabolism, 278, 278f 
maternal serum screening in, 275, 279-280, 
280/ 


Prenatal diagnosis (Continued) 

percutaneous umbilical blood sampling in, 
278 

preimplantation, 281 
ultrasonography in, 279-280, 279b, 279/ 
Pressure palsies, hereditary neuropathy with, 

29 

Primaquine, 294 

Primary care practitioner, 307-308 
Primary care provider, 311 
Primary ciliary dyskinesias, 2\\b~2\2b 
Primary transcript, 12 
splicing of, 15, 15/ 

Primer extension, 49 
Primers 

in DNA sequencing, 50 
in polymerase chain reaction, 48-49, 49/ 
Primitive streak, 206, 207/ 208 
Principle of independent assortment, 60, 154, 
155/ 

Principle of segregation, 60 
Prior probability, 305 
Probability 

basic concepts of, 54-55 
Bayesian analysis and, 305^-306^ 

Proband, 63 
Probe(s) 

in DNA microarrays, 53 

in fluorescence in situ hybridization, 105, 

106/ 

in Southern blotting, 42-45 
Progress zone, 213-214, 214/ 

Progressive myoclonic epilepsy type 1, 

98t~99t 

Prokaryotes, 15 

intron-exon structure of, 20 
Promoter, 12 
Promoter mutation, 29 
Proofreading, in DNA replication, 8 
Prophase, 23, 24/ 

Prophase I, 23, 25/ 

Prophase II, 26 
Propionic acidemia, 134r 
Propositus, 63 

Prostate cancer, 248h, 256, 274b, 298 
Protanomaly, 86, 87/ 

Protanopia, 86, 86/-87/ 

Protein electrophoresis, 41-42, 42/ 

Protein kinases, 219-220, 219/ 

Protein synthesis, 11, 11/ 
cell cycle and, 22 

posttranslational modification in, 18 
Protein truncation, 53f 

Protein tyrosine phosphatase, non-receptor type 
11 (PTPNll) gene, 202-203 
Proto-oncogenes, 223 
Pseudoachondroplasia, 98f-99f 
Pseudoautosomal region, 116, 116/ 

Pseudogenes, 188-189 
Pseudomosaicism, 276 
Psoriasis, 192f, 274b 

Psychiatric disorders, 247, 261-262, 261t 

PTC gene, 199 

PTCH gene, 222t 

PTEN gene, 222t 

Pulmonary disease, 61 

Pulmonary hypertension, 112 

Pulmonary hypoplasia, 313-314 




Punnett square, 28, 29/ 

for autosomal dominant inheritance, 63, 64/ 
for autosomal recessive inheritance, 64-65, 
65/ 

Purine nucleoside phosphorylase deficiency, 
I95t 

Purines, 6, 7/ 

Pyloric stenosis, 2f, 240-241, 240/ 241t, 247, 
247f, 262-263 

Pyrimidine dimers, 35, 36/ 
removal of, 37b-38b 
Pyrimidines, 6 

Pyruvate carboxylase deficiency, 134f 
Pyruvate dehydrogenase complex (El) 
deficiency, 134t 

Q 

q arm of chromosome. See Long arm (q), of 
chromosome 
Quadruple screen, 280 
Quasidominant inheritance, 66 
Quinacrine banding (Q-banding), 104 

R 

Race, and precision medicine, 298-299, 299/ 

Radial aplasia, 112-113 

Radiation 

as mutagen, 35 
mutation rates due to, 37^ 

Radical fringe (r-Fng) gene, 213 
Radon, 37 
RAG genes, 187 
SCID and, 194 
Random mating, 56 
RAS oncogene, 224-225 
RAS protein, 228 
RBI gene, 221/ 222, 222t 
mutation, 221/ 
reduced penetrance of, 70 
Recessive allele, 28 
Recessive diseases 

compound heterozygote in, 36/ 
loss-of-function mutations in, 31-32 
Reciprocal translocations, 118-119, 119/ 
Recombinant DNA, 44l7-45l7, 44/ 
Recombinases, 187 
Recombination, 154, 155/-156/ 
somatic, 187, 187/ 

Recombination frequency, 154-155 
pedigrees and, 156, 157/ 

Recombination hot spots, 156-157 
Recurrence risks 

for autosomal dominant disease, 64 
for autosomal recessive disease, 66 
definition of, 64 

in genetic counseling, 304-305, 305/7-306/?, 
305/-306/, 306r, 307 

for multifactorial diseases, 241-243, 24 U, 
243t 

Red-green color blindness, 81, 86 
Reduced penetrance, 68-69 
Reduction division stage, 23 
Refsum disease, 144 
Regulatory T cells, 184, 192 
Renal agenesis, 279/?, 313-314 
Renal dysgenesis, 315t 
Renal failure, 149 
Renal insufficiency, 136-137 
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Renin-angiotensin-aldosterone system, 255/ 
Repeat expansions 

diseases associated with, 97-99, 98t-99f 
in Huntington disease, 71 
in myotonic dystrophy, 96-99 
Repetitive DNA, 21 
Replication bubbles, 9-11, 10/ 

Replication origins, 9-11, 10/ 

Restriction digest, 42 
Restriction endonucleases, 42 
Restriction enzymes, 42, 43/ 

in recombinant DNA technology, 44/ 
Restriction fragment analysis, 42-45 
Restriction fragment length polymorphisms 
(RFLPs), 45, 48/ 

Restriction sites, 42 
polymorphisms in, 45-46 
RET proto-oncogene, 236-237 

Hirschsprung disease and, 204/?, 247 
Retinal disease, 141 

Retinitis pigmentosa, 161, 163/?, 163/ 175f-176f, 
301-302 

Retinoblastoma, 2t, 68-69, 69/-70/ 70/?, 129f, 
166-167, 175M76r, 217-218, 220-221, 223, 
226, 230. See also RBI gene 
somatic, 221/ 222, 222t 
Retinoic acid, 247 
Retinoschisis, juvenile, 88t 
Retroviral vectors, for gene therapy, 282-283, 
283/ 

Retroviruses, 223-224 
Rett syndrome, 90, 175M76t, 312f 
Reverse banding (R-banding), 104 
Reverse transcriptase 
of retroviruses, 223-224 
telomerase as, 226 
Reye syndrome, 132, 148 
RFLPs. See Restriction fragment length 
polymorphisms (RFLPs) 

Rh system, 41, 193 
Rheumatic fever, 191-192 
Rheumatoid arthritis, 192t, 298 
Rhizomelic chondrodysplasia punctata type 1, 
144 

Ribonucleic acid (RNA). See RNA (ribonucleic 
acid) 

Ribosomal RNA (rRNA), 16 
Ribosome, 16 

Ribozyme therapy, 286/ 287 
Rickets 

hypophosphatemic, 90 
vitamin D-resistant, 88f 
Ring chromosomes, 126, 126/ 

Risk estimation. See Recurrence risks 
RNA (ribonucleic acid), 11 
RNA interference, 287, 287/ 

RNA polymerase II, 12, 12/-13/ 

RNA sequencing, in gene expression testing, 

I7I, 171/ 

Robertsonian translocations, 119-120, 119/ 121/ 

Rod cells, 86 

Rod monochromacy, 86 

Romano-Ward syndrome, 252 

rRNA (ribosomal RNA), 16 

RTK-MAPK signaling pathway, 202-203, 203/ 

Rubinstein-Taybi syndrome, 123t, 213, 3l2t 

Runx2 gene, 215 

Russell-Silver syndrome, 126 


s 

S (synthesis) phase, 22, 22/ 

Saethre-Chotzen syndrome, 14f 
Sandhoff disease, 146f 
Sanfilippo syndrome, 145f 
Sarcomas, 217 

osteosarcomas, 229 
soft-tissue, 229 
Satellite DNA, 21, 22/ 

Satellites of acrocentric chromosomes, 103, 

104/ 

Scalp defects, 315t 
Scheie syndrome, 145 
Schindler disease, 146f 
Schinzel-Giedion syndrome, 174 
Schizophrenia, 2t, 174, 245-246, 245r, 248/?, 
261-262 

SCID. See Severe combined immune deficiency 
(SCID) 

Scoliosis, 308/? 

Marfan syndrome and, 75 
neurofibromatosis and, 72-73 
Second-degree relatives, 63 
Segregation, principle of, 60 
Seizures, 268-269 
Selection coefficient, 58 
Sense strand, 12, 12f 
Sensitivity, of test, 265-267, 267f 
Sensorineural hearing loss, 112, 112/? 

Sepsis, 135 
Sequence, 313, 314/ 

Sequence-tagged sites (STSs), 169b 
Serious adverse drug reaction (SADR), 293 
Severe combined immune deficiency (SCID), 
194, 195f, 282 

gene therapy for, 284/?, 284t 
Sex chromosome aneuploidy, 114-117 
Sex chromosomes, 6. See also X chromosome; 

Y chromosome 
Sex determination, 116/? 

Sex selection, 318 
Sex-influenced traits, 91 
Sex-limited traits, 91 

Sex-linked inheritance, 79-102, 101/ See also 
X-linked dominant inheritance; X-linked 
recessive inheritance; Y-linked inheritance 
Sex-linked mutations, 79 
Sherman paradox, 100-101, 100/ 

Shh gene, 199, 200/ 206, 213-214 
Short arm (p), of chromosome, 103 
Short interspersed elements (SINEs), 21-22 
Short tandem repeats (STR), 45-46 
DNA profiles based on, 47/ 
as markers, 159 
Shprintzen syndrome, 124 
Sib-pair method, 248/? 

Sickle cell disease, 2r, 3, 33, 33t, 34/ 49, 53-54, 
58, 167-168, 175M76f, 266/?, 268, 272-273, 
272f, 296b 

allele-specific oligonucleotide probe in, 273/ 
clinical impact of, 4 
heterozygotes and, 56-57 
screening for, 269t 
in homozygotes, 67 
inheritance of, 1-3 
malaria and, 56-57, 57/ 
newborn screening for, 269t 
protein electrophoresis for, 41, 42/ 
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Sickle cell disease (Continued) 

Southern blotting in, 45, 46/ 
treatment of, 35 

SIDS. See Sudden infant death syndrome 
(SIDS) 

Signal transduction, 219-220, 219/ 

RTK/Ras GTPase/MAPK (RTK-MAPK) 
signaling pathway, 202-203, 203/ 
Silencers, 13, 13/ 

Silent substitutions, 28-29 
Silver-Russell syndrome, 96, 96/ 

SINE (short interspersed elements), 21-22 
Single-copy DNA, 21, 22/ 

Single-gene diseases, 239-241, 243 
variable expression of, 72 
Single-gene disorders 

childhood deaths attributable to, 4t 
prevalence of some, 2t 
Single-gene inheritance, vs. multifactorial 
diseases, 243-244, 244/ 

Single-gene mutations, parental age and, 39-40, 
40/ 

Single-gene traits, 60 

Single nucleotide polymorphisms (SNPs), 46, 
48/ 156 

ancestry and, 298, 299/ 
consumer services providing, 293b 
in genome-wide association studies, 172-173, 
172/-173/ 

in Human Genome Project, I69b 
in linkage disequilibrium, 173, 173/ 

Single peroxisomal enzyme deficiencies (PEDs), 

144 

Sister chromatid exchange, 22 
Sister chromatids, 22, 24/ 26/ 

Situs ambiguus, 212/ 

Situs inversus, 212/ 

Situs solitus, 212f 
Skeletal defects, 200b-202b 
Skeletal formation, 215 
Skeletal myopathy, 141 
Skin cancer, from UV radiation, 35 
Skin tumors, 37-38, 38f 
Skull ossification defects, 315f 
Sly syndrome, 145t 
SMAD4 gene, 222r, 230 
Smallpox, I9lb 

Smith-Lemli-Opitz syndrome, 134t, 141-142, 
142/ 206 

Smith-Magenis syndrome, 106/ 123r 
Smo gene, 199 

Smoking, and tti-antitrypsin deficiency, 273 
SNPs. See Single nucleotide polymorphisms 
(SNPs) 

Solenoid, 7-8, 9/ 

Soluble carrier family 3, member 1 amino acid 
transporter (SLC3A1), 149 
Solvent abuse, as teratogen, 315r 
Somatic cell, 6 

gene therapy, for familial 

hypercholesterolemia, 250b-252b 
Somatic cell nuclear transfer (SCNT), 288-289, 
2SSb-2S9b, 289/ 

Somatic hypermutation, 182, 187-188 
Somatic mutations, 28 
from radiation, 37b 
Somatic recombination, 187, 187/ 

Sonic hedgehog (Shh), 199, 200/ 


Southern blotting, 42-45, 43/ 
autoradiogram in, 42-45, 45/ 
in mutation detection, 53t 
in sickle cell disease, 45, 46/ 

Southern transfer, 42 
SOX genes, 204, 204^? 

Spatial colinearity, 208, 209/ 

Specific transcription factors, 13, 13/ 
Specificity, of test, 265-267, 267t 
Spectral karyotyping, 105-107, 107/ 
Spemann-Mangold organizer, 206 
Sperm cells, 26 
Spermatids, 25/ 26 
Spermatocytes, 25/ 26 
Spermatogenesis, 25/ 26 
Spermatogonia, 23 
Sphingolipidoses, 145-147, 146f 
Spina bifida, 2r, 112-113, 206, 242^7-243^?, 242/ 
245f, 275, 279-280, 280/ 315r, 317/?. See 
also Neural tube defects (NTDs) 
surgical closure of, 282 
Spinal and bulbar muscular atrophy, 98t-99f, 
143-144 

Spindle fibers, 23, 24/ 26 
Spinocerebellar ataxias, 97-99, 98r-99r 
Splenomegaly, 33-35, 33f, 35/ 147 
Splice-site mutations, 29-30, 31/ 

Splicing, gene, 15, 15/ 

Spontaneous abortion, 315r 

Spontaneous mutations, 35 

SRYgene, 116, 116/204 

Stalks, of acrocentric chromosomes, 104/ 

Staphylococcal skin disease, 301 

Stem cell(s) 

bone marrow, gene therapy using, 282, 

284/? 

embryonic, 289, 318 

of mouse, 209b-211b, 210/ 

Stenosis, neurofibromatosis and, 73 

Stent thrombosis, 294-295 

Steroid hormone receptors, defects of, 

143-144 

Steroid hormones, synthesis of, 142-143, 143/ 
Stomach cancer, 219 
Stop codons, 15, 16f 

mutation causing, 28-29, 30/ 

STR. See Short tandem repeats (STR) 
Strabismus, 65-66, 112, 112/? 

Streptomycin, as teratogen, 315f 
Stroke, 2t, 247-249, 252-254, 257-258 
sickle cell disease and, 33, 45 
Subdural hematoma, Huntington disease and, 
71 

Submetacentric chromosome, 103, 104/ 
Substitution, base-pair, 28-29, 30/ 
Subtelomeric rearrangements, 123-125, 125/ 
Succinylcholine, 294 
Sudden infant death syndrome (SIDS), 
metabolic defect and, 132 
Sulfonylureas, 294t 
Superoxide dismutase-1 (SOD-1), 167 
Supravalvular aortic stenosis (SVAS), 122, 

122 / 

Synapsis, 23 
Syndrome 

definition of, 313 

isolated malformations and, 313 

sequence and, 314 


Synostosis, 213 
Synpolydactyly, 98f-99f, 214 
Syntenic loci, 155 

Systemic lupus erythematosus, 192f, 314 

T 

T lymphocytes, 180-182, 181/ 185/ 
cytotoxic, 180-181 
helper, 180-181, 181/ 

Talipes equinovarus, 2t 
Tandem mass spectrometry, in newborn 
screening, 268 

Tandem repeats, 21. See also Variable number of 
tandem repeats (VNTR) 
expanded, 30-31. See also Repeat expansions 
polymorphisms of, 45-46, 46/ 

Tangier disease, 253t-254r 
TAP genes, 189-190, 194 
Targeted disruption, 209b-2llb 
Tarui disease, 136/ 138f 

Tay-Sachs disease, 2t, 146r, 175f-176r, 265, 266/?, 
268-269, 269t, 271, 278t, 281, 311/? 
genetic drift and, 57-58 
T-box, 213 

T-cell immunodeficiency diseases, 194 
T-cell receptors, 188, 188/ 
genes for, 190 

major histocompatibility complex and, 190 
Telangiectases, 38f 
Telomerase, 226 
Telomeres, 103 

of chromosomes, 226 
recombination frequencies near, 156-157 
Telophase, 23, 24/ 

Telophase 1, 25, 25/ 

Telophase II, 26 

Template, in DNA replication, 8 
Temporal colinearity, 208, 209/ 

Teratogens, 314, 315f 
Bendectin and, 316/? 

Teratology, 312-314. See also Dysmorphology 

Terminal deletion, 120 

Termination sequence, 12 

Testicular atrophy, 96 

Testicular feminization syndrome, 143 

Tetracycline, as teratogen, 315f 

Tetrad chromosomes, 23 

Tetralogy of Fallot, 308/? 

Tetraploidy, 108 

TGF-p. See Transforming growth factor p 
(TGF-p) 

Thalassemia, 2t, 32-35, 266b, 285 
a-Thalassemia, 33-35, 175f-176f 
heterozygote screening for, 269t 
p-Thalassemia, 30, 33-35, 33t, 35/ 175M76f, 
281,284r 

autosomal dominant, 67 
heterozygote screening for, 269, 269t 
treatment of, 35 
Thalidomide, 247 
as teratogen, 315f 

Thanatophoric dysplasia, 200b-202b, 207-208, 
279b 

Thiopurine methyl transferase (TPMT), 293, 
294t 

Thiouracil, as teratogen, 315f 
Threshold model, of multifactorial disorders, 
239-241 




Thrombocytopenia, 191^ 

Thymine, 6, 7/-8/ 

Thyroid carcinoma, 37, 224t 
medullary, 236 
papillary, 236 

Tibial bowing, neurofibromatosis and, 73 
Tibial pseudarthrosis, neurofibromatosis and, 

73 

Tissue-specific mosaicism, 112 
Toll-like receptors, 180 
Torsion dystonia, genetic drift and, 57-58 
TP53 gene, 2226 228-229 
Trace elements. See Heavy metal transport 
defects 

Transcription, 11/-12/, 12 
of mitochondrial DNA, 92 
regulation of, 12-15 

Transcription activator-like effector nucleases 
(TALENs), 209 / 7 - 211/7 
Transcription factors, 14 
cell growth and, 219/ 
diabetes mellitus and, 258 
DNA-binding motifs of, 14, 14/, 14f 
in embryonic development, 203-204, 
211 / 7 - 212 / 7,214 
general, 13, 13/ 
imprinting and, 96 
specific, 13, 13/ 

Transduction, 282 
Transfer RNA (tRNA), 15-16, 16/ 
mitochondrial DNA coding for, 92 
Transferase deficiency galactosemia, 133-135 
Transforming growth factor |3 (TGF-p), 199 
cystic fibrosis and, 72 
Marfan syndrome and, 76 
Transgenic mouse, 209/7-211/7, 210/ 

Translation, 11, 11/ 15-18, 17/ 

Translocations, 118-120, 119/ 121/ 
in cancer, 225 
physical mapping and, 167 
Transmission patterns, 241-243 
Trichromatic vision, 86 
Tricyclic antidepressants, 294-295, 295/ 
Trimethadione, as teratogen, 315f 
Triploidy, 108 
Trisomy, 109, 109/ 
maternal age and, 114 
partial, 119, 119/ 

Trisomy 13, 2t, 113-114, 113/ 118t, 312t, 313, 
317 

prenatal diagnosis of, 280 
Trisomy 14, 120, 121/ 

Trisomy 18,26 112-113, 112/7, 113/ 1186 127, 
3l2t 

birth of child with, 308/7 
prenatal diagnosis of, 280 
Trisomy 21, 26 3, 110-113, 111/ 1186 120, 121/ 
127-128, 277 / 7 , 280-281, 280/ 302-303, 
308/7, 310 / 7 , 311, 312f 

anticipatory guidance and health supervision 
in, 112/7 

prenatal diagnosis of, 280 

prevalence of, 114, 114f 

talking to parents of newborn with, 304/7, 

307 

Trisomy X, 117 

tRNA. See Transfer RNA (tRNA) 

True negatives, 265-267, 267f 
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True positives, 265-267, 267f 
TSC2 gene, 222f 
Tuberculosis, 294, 298 
Tuberous sclerosis, 222t 
Tuberous sclerosis type 1, 175f-176t 
Tuberous sclerosis type 2, 175f-176t 
Tumor suppressor genes, 222-223, 222f 
compared with oncogenes, 224r 
identification of, 226 
Tumorigenesis, 217, 225 
Tumors, 217 
brain, 229 
breast, 227/ 
cells, 225-226, 231/7 
colon, 230 

deleted chromosome regions in, 226 
human, 228-229 
malignant, 232/ 

MEN2B, 236 
monoclonal, 217 
multiple, 221/ 
primary, 229 
skin, 225 

retinoblastoma, 217-218 

solid, cytogenetic changes in, 129t 

sporadic, 223 

suppressor genes, 222-223, 227/ 231/ 
supressor mutations, 226 
Wilms, 226 

Turner syndrome, 26 3, 112, 112/7, 114-115, 
115/ 127-128, 312f 
X-linked recessive diseases and, 90 
22qll deletion syndrome, 108/ 124, 124/ 
Twin studies, 1, 244-246, 244/ 245f 
Two-hit model of carcinogenesis, 220 
Type 1 diabetes, 274/7 
Type 2 diabetes, 274b 
Tyrosinase-negative albinism, 65-66, 65/ 
Tyrosinase-negative oculocutaneous albinism, 
133/ 

Tyrosine, metabolism of, 138-140 
Tyrosinemia, 133/ 266/?, 268 
type 1, 134t 
type 2, 140 
type 3, 140 

u 

Ulnar-mammary syndrome, 146 214, 214/ 
Ultrasonography, prenatal, 279, 279b, 279/ 
nuchal translucency in, 280 
Ultraviolet radiation, mutations from, 35, 
37 / 7 - 38 / 7 , 38/ 

Umbilical blood sampling, percutaneous, 278 
Unbalanced rearrangements, of chromosome, 
118, 118r 

Underossified skull, 315f 
Uninformative mating, 159-160, 161/ 
Uniparental disomy, 94-95, 125-126, 125/ 
Upstream DNA sequences, 8 
Uracil, 11, 12/ 

Urea cycle, 147, 147/ 

defects of, 1346 139, 147-148 
Uridine diphosphate, 136/ 

Uridine diphosphate (UDP)-galactose-4- 
epimerase, 135-136, 136/ 

Urinary obstruction, 317/7 
Usher syndrome, 163/7 
Uterine abnormalities, 315f 


V 

Vaginal adenocarcinoma, 315f 
Vaginal adenosis, 315f 
Validity, of test, 265-267 
Valproic acid, as teratogen, 315f 
van der Woude syndrome, 313 
Variable expression, 69-72, 72b-73b 
Variable number of tandem repeats (VNTR), 
45-46 

Variable region, of immunoglobulin chain, 186, 
186/ 

Vascular endothelial growth factor (VEGF), in 
gene therapy, 288 
VATER association, 312f 
VDJ recombination, 187, 187/ 

Vectors 

for cloning, 44/7-45/7 
gene therapy, 285 
in Human Genome Project, 169/7 
Velocardiofacial syndrome, 123, 1236 124/7 
Ventricular septal defects (VSDs), 112-113, 128, 
241 

Very long-chain acyl-CoA dehydrogenase 
(VLCAD) deficiency, 134t 
VHL gene, 222f 
Viral gene therapy, 285 
Visceromegaly, 145-147 
Visual acuity, reduced, 65-66 
Vitamin D-resistant rickets, 88r 
Vomiting, 3l6b 
von Gierke disease, 136/ 138f 
von Hippel-Lindau syndrome, 2226 230, 237 
von Recklinghausen disease, 72 
von Willebrand disease, 83, 175M76t 

w 

Waardenburg-Shah syndrome, 204/7, 205/ 
Waardenburg syndrome, 169-170 
Warfarin 

polymorphisms and response to, 2946 
295-296 

as teratogen, 315f 
Werner syndrome, 38 1 
Whole-genome sequencing, 173-176 
Williams syndrome, 105, 112, 112/?, 122-123, 
122/ 1236 2726 312r 

Wdms tumor, 95, 1296 166-167, 2226 226, 302b 
in WAGR syndrome, 122-123, 123r 
Wilson disease, 1346 149, 152, I75t-I76t 
Menkes disease and, 151 
Wingless (Wnt) family, 199, 213 
Wiskott-Aldrich syndrome, 194, 1956 284/7, 

284f 

Wnt signal transduction pathway, 230-233 
Wolf-Hirschhorn syndrome, 120, 122-123, 122/ 
WTl gene, 222t 

X 

X chromosome, 6 

monosomy of, 114-115, 115/ 

X inactivation, 79-81, 80/ 

X inactivation center, 80-81 
45,X syndrome, 1 18t 
Xanthomas, 250/7-252/7, 250/ 

Xeroderma pigmentosum, 37b-3Sb, 38/ 2226 
225 

XIST gene, 81 

X-linked adrenoleukodystrophy, 144, 284r 
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X-linked agammaglobulinemia (XLA), 194, 
195f 

X-linked diseases, prevalence of, 4f 
X-linked dominant inheritance, 90-91, 91/ 
vs. X-linked recessive pattern, 911 
X-linked ocular albinism, 79-80, 80/ 
X-linked recessive disorders, 83, 144 
X-linked recessive inheritance, 81-90 
disorders with, 88t 
females affected by, 88 
pedigree for, 89/ 

Punnett square for, 89/-90/ 

vs. X-linked dominant pattern, 9If 


X-linked severe combined immunodeficiency, 
282, 284b, 284f, 288 
XPA gene, 222f 
XX males, 116b, 116/ 

47,XXX syndrome, 118f 
47,XXY syndrome, 118f 
XY females, 116b, 116/ 

47,XYY syndrome, 117, 118f 

Y 

Y chromosome, 6, 79 

Yeast artificial chromosomes, 44^-45^, 169^ 
Y-linked inheritance, 91, 91/ 


Z 

Zap70 kinase deficiency, 195f 
Zellweger syndrome, 144, 278f 
Zinc, 152 
Zinc finger, 14t 

Zinc-finger nucleases (ZFNs), 2Q9b-211b 
Zinc-finger protein of the cerebellum, 

211b-212b 
Zinc ion, 149 

Zone of polarizing activity (ZPA), 213-214, 214/ 
Zygote, 22 
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